Method of determining tumor sensitivities to therapeutic drugs

ABSTRACT

The present invention provides a new method for predicting whether or not a tumor (cancer, sarcoma, melanoma, etc.) in a patient will respond to a specific drug known to inhibit an enzyme that contributes to the pathogenesis of that type of tumor. The method involves conducting a comparative analysis between the enzymatic site of a reference enzyme known to interact with the drug and the enzymatic site of a target enzyme produced by the tumor cells of the patient. Identification of differences between the alleles of the two enzymes thus provides a basis for determining whether or not the drug will interact with the target enzyme in the tumor cells of the patient so as to inhibit its contribution to the pathogenesis of the tumor.

CROSS-REFERENCE TO RELATED APPLICATION

[0001] This application claims priority to U.S. Provisional Patent Applications Nos. 60/387,406 and 60/387,370, both filed Jun. 10, 2002.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

[0002] This invention was developed in part using United States Government support awarded by the National Institute of Health under grant number RO1-AR43356. The United States Government has certain rights in this invention.

BACKGROUND OF THE INVENTION

[0003] The selection of drugs for use in the treatment of individual cancer patients is based largely on results obtained in previous clinical trials and physician experience. Unfortunately, a physician's experience and prior clinical data may not accurately reflect how a cancer patient will respond to a particular drug application.

[0004] Recent studies have attempted to develop in vitro drug sensitivity systems, commonly referred to as chemosensitivity tests, to predict whether a specific anti-cancer drug will be effective in fighting cancer in an individual patient. Such systems take advantage of recent advances in cancer research which now allow cancer cells from a patient to be grown in test tubes and exposed to various anti-cancer medications. The observed response of the cancer cells to the applied drugs may then provide insight as to whether a particular drug application will effectively treat the cancer.

[0005] Two approaches, the cell sensitivity assay and the human tumor colony-forming assay, have shown some promise but are not yet practical for general application. In the human tumor colony-forming assay, the patient's cancer cells are placed in test tubes where they are exposed to the anti-cancer drugs being screened. After a short period of exposure, the exposed cells are then placed in a growth medium to determine if they will grow. Little or no growth indicates that the applied drug has been effective and may serve as a likely candidate for treating the cancer in that patient. The cell sensitivity assay, on the other hand, grows the cells first and then tests them for their drug sensitivity.

[0006] Although both studies have shown some value in predicting drug effectiveness, each test continues to have several disadvantages that limit their routine use. For instance, cells from certain cancers, such as colon, breast and lung cancers, are hard to grow in vitro such that an application of either test is unlikely. In addition, both tests often require between 2 to 3 weeks to provide results as the cells require time to grow and proliferate. The tests are also not applicable to pharmaceuticals that require interaction with bodily chemicals not present in the in vitro setting, nor is it yet possible to accurately duplicate in the in vitro environment the drug concentrations and duration of exposure of the cancer cells to the drugs within the body. As a result, the cell sensitivity assay and the human tumor colony-forming assay are very rarely used.

[0007] What is needed is a new technique that can more rapidly and accurately predict whether or not a specific anti-cancer drug is likely to be effective in treating cancer. For example, the KIT protein, which is the tyrosine kinase receptor for stem cell factor (SCF) is encoded by the c-KIT proto-oncogene. KIT is essential for normal development of mast cells and certain types of gastrointestinal stromal cells in humans and other mammals. Mutations of the c-KIT gene, however, are responsible for various types of tumors, including Gastrointestinal Stromal Tumors (GISTs) believed to originate from different c-KIT mutations that cause spontaneous activation of the KIT tyrosine kinase. Sporadic Adult Human Mastocytosis (SAHM) is also caused by specific mutations in c-KIT codon 816, which constitutively activate the KIT kinase. Certain mast cell lines and canine mast cell tumors also express a variety of other activating c-KIT mutations.

[0008] Studies have shown that small molecules that inhibit mutant activated KIT effectively kill these cell lines and may be clinically effective against GISTs. Unfortunately, recent studies have shown that drugs known to be effective in treating various forms of GISTs and some mast cell tumor lines will not be effective in treating SAHM because of resistance of the particular SAHM mutant KIT protein to these same drugs. What is needed is a new technique that can more rapidly and accurately predict whether or not a specific drug known to inhibit one form of the tumor expressing activated KIT protein will also be able to effectively interact with and treat other tumors, such as Sporadic Adult Human Mastocytosis arising in individual patients.

BRIEF SUMMARY OF THE INVENTION

[0009] The present invention is summarized as a new method of predicting whether or not a tumor will respond to a specific drug which is known to inhibit an enzyme that contributes to the pathogenesis of that type of tumor. The method involves conducting a comparative analysis between the enzymatic site of a reference enzyme known to interact with the drug and the enzymatic site of the target enzyme produced by the tumor cells of the patient. Identification of differences between the alleles of the two enzymes thus provides a basis for determining whether or not the drug will interact with the target enzyme in the tumor cells of the patient so as to inhibit its contribution to the pathogenesis of the tumor.

[0010] The method includes the steps of obtaining a sample of tumor cells from a patient, determining the allele of the enzymatic site of the target enzyme to be tested for its ability to interact with a drug of interest, and comparing the allele of the target enzyme to a reference providing information regarding the allele of a reference enzyme known to interact with the drug of interest. The allele of the enzymatic site of the target enzyme is determined by analyzing a biomolecule from the tumor cells. The biomolecule may be either a nucleotide sequence encoding an enzymatic site of the target enzyme, a nucleotide sequence encoding a fragment of the enzymatic site of the target enzyme, a polypeptide fragment from the enzymatic site of the target enzyme, or the enzyme itself. The analysis of the biomolecule may be conducted using either a gene chip, a probe, monoclonal or polyclonal antibodies, or by sequencing the biomolecule or performing a nucleotide polymorphism analysis of the biomolecule.

[0011] To determine if the target enzyme is likely to interact or not interact with the drug of interest, the allele of the target enzyme is compared to a reference providing information regarding the allele of an enzymatic site of a reference enzyme known to interact with the drug of interest. The reference may be either a reference biomolecule, a reference database, or a control. The reference biomolecule may include either a polynucleotide encoding the enzymatic site of the reference enzyme, or a polynucleotide encoding a fragment of the enzymatic site of the reference enzyme, including a labeled or unlabelled probe or a gene chip designed to include polynucleotide fragments representing portions of the nucleotide sequence encoding the enzymatic site of the reference enzyme, or polypeptide fragments from the enzymatic site of the reference enzyme, the reference enzyme itself, or monoclonal or polyclonal antibodies having an antigen-binding region located within the enzymatic site of the reference enzyme. The reference database will generally include a selection of data containing information regarding the enzymatic site of the reference enzyme.

[0012] A finding that a target enzyme is likely to interact or to not interact with a drug of interest may be further enhanced when combined with evidence of expression and/or activation of the target enzyme using any one of a number of additional studies. An example of one such study may include the detection of specific enzyme activation by identification of the phosphorylation state of the target enzyme. A second additional study may include determining tumor drug sensitivity by direct testing the sensitivity of the target enzyme to specific drugs in vitro.

[0013] The present invention also includes a method for predicting whether or not a tumor expressing activated KIT will respond to a specific drug which is known to inhibit other KIT positive tumors. The method involves conducting a comparative analysis as described above between the enzymatic site of a c-KIT reference enzyme known to interact with the drug and the enzymatic site of the c-KIT enzyme produced by the tumor cells of the patient, referred to as the c-KIT target enzyme. Identification of differences between the alleles of the two enzymes thus provides a basis for determining whether or not the drug will effectively interact with the c-KIT target enzyme in the patient's tumor cells.

[0014] The present invention also includes a method for predicting whether or not a tumor expressing activated MEK will respond to a specific drug which is known to inhibit other MEK positive tumors. The method involves conducting a comparative analysis as described above between the enzymatic site of a Raf-1 reference enzyme known to interact with the drug and the enzymatic site of the Raf-1 enzyme produced by the tumor cells of the patient. Identification of differences between the alleles of the two enzymes thus provides a basis for determining whether or not the drug will effectively interact with the Raf-1 target enzyme in the patient's tumor cells.

[0015] The present invention also includes a method for predicting whether or not a tumor expressing the bcr-abl oncogene will respond to a specific drug which is known to inhibit other BCR-ABL positive tumors. The method involves conducting a comparative analysis as described above between the enzymatic site of a BCR-ABL reference enzyme known to interact with the drug and the enzymatic site of the BCR-ABL enzyme produced by the tumor cells of the patient. Identification of differences between the alleles of the two enzymes thus provides a basis for determining whether or not the drug will effectively interact with the BCR-ABL target enzyme in the patient's tumor cells.

[0016] The present invention also includes a method for predicting whether or not a tumor expressing the flt3 oncogene will respond to a specific drug which is known to inhibit other FLT3 positive tumors. The method involves conducting a comparative analysis as described above between the enzymatic site of a FLT3 reference enzyme known to interact with the drug and the enzymatic site of the FLT3 enzyme produced by the tumor cells of the patient. Identification of differences between the alleles of the two enzymes thus provides a basis for determining whether or not the drug will effectively interact with the FLT3 target enzyme in the patient's tumor cells.

[0017] The present invention also encompasses kits for carrying out the method of the present invention. The kit will generally comprise a research tool for use in determining the allele of the target enzyme and a reference for comparing the allele of the target enzyme against a reference enzyme known to interact with one or more drugs of interest. The research tool will generally include materials for isolating and/or analyzing the biomolecule from the tumor of the patient, such as a gene chip, labeled or unlabeled probes, monoclonal or polyclonal antibodies, or materials for sequencing the biomolecule or identifying nucleotide polymorphisms. The included reference will vary depending upon the type of tumor and/or drug under investigation, and will generally include either a reference biomolecule, a reference database or a control as described above.

[0018] One advantage of the present invention is that it increases the efficiency of in vitro testing because it does not require the demonstration of specific regulatory type mutations or events, which may be more difficult to demonstrate than enzymatic site type mutations. A second advantage is that it is also useful in cases of ligand induced activation, such as autocrine activation, since identification of this form of enzymatic activation may also be difficult. A third advantage is that it does not require in vitro culture of the patient's tumor cells with the associated complexity, expense and potential for bias. Other advantages will be readily discernable upon review of the detailed description and examples set forth below.

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

[0019]FIG. 1 illustrates differential sensitivity of “regulatory type” and “enzymatic pocket” type, and differentially substituted codon 816 mutant KIT to KIT inhibitors.

[0020]FIG. 2 illustrates differential sensitivity of V560G and D816V mutant KIT in HMC 1 subclones to KIT inhibitors.

[0021]FIG. 3 illustrates the induction of apoptosis in HMC1.1 but not in HMC1.2 cells by KIT inhibitors.

[0022]FIG. 4 illustrates differential effects of KIT inhibitors on the growth of HMC1.1 and HMC 1.2 cells.

DETAILED DESCRIPTION OF THE INVENTION

[0023] The present invention is a new method of predicting whether or not a tumor (cancer, sarcoma, melanoma, etc.) in a patient will respond to a specific drug known to inhibit similar tumors. The method involves conducting a comparative analysis between the enzymatic site of a reference enzyme known to interact with the drug and the enzymatic site of an analogous enzyme produced by the tumor cells of the patient. Identification of differences between the alleles of the two enzymes thus provides a basis for determining whether or not the drug will interact with the target enzyme in the tumor cells of the patient so as to inhibit its contribution to the pathogenesis of the tumor. The present invention also includes a method for predicting whether or not a tumor expressing activated KIT will respond to a specific drug which is known to inhibit other KIT positive tumors, as well as kits for practicing the present invention.

[0024] Mutations causing ligand independent constitutive phosphorylation and activation of certain enzymes have been shown to transform cell lines from factor-dependent growth to factor-independent growth in vitro and indolent tumors to aggressive tumors in vivo. It is recognized that different types of mutations can activate these enzymes through two different types of mechanisms. One type, called “enzymatic pocket” type mutations, includes mutations which affect the structure of the enzymatic site of the enzyme. For example, one such mutation may include the D816V substitution in KIT kinase. The D816V substitution is characteristic of adult human mastocytosis and affects the activation loop at the entrance to the enzymatic “pocket” formed by the split intracellular domain of KIT kinase. The second type, termed “regulatory type” mutations, does not affect the amino acids which directly form the enzymatic site, but instead affect portions of the molecule which regulate enzyme activity. For instance, an amphipathic alpha helix discovered in the intracellular juxtamembrane region of KIT suppresses phosphorylation and kinase activity in ligand unoccupied KIT, thereby regulating KIT activity. Mutations which disrupt this helix cause a release of inhibitory regulation, thus causing constitutive activation of KIT. Other potential mechanisms for regulatory type mutations could include effects on substrate access to the enzymatic site, effects on the binding of signal transducing or regulatory molecules to the enzyme polypeptide, or the induction of ligand independent dimerization with subsequent autophosphorylation and activation.

[0025] The distinction between regulatory and enzymatic pocket type mutations has therapeutic implications. Data now shows that molecules (i.e., therapeutic drugs) which bind to and obstruct the function of the wild type enzymatic site will not only effectively inhibit ligand-induced activation but may also inhibit activation by mutations affecting intra-molecular regulation of kinase activity (regulatory type mutations). However, the same drug may not bind at all to the altered enzymatic site of an enzyme bearing an enzymatic pocket mutation. Conversely, inhibitors which bind to an altered enzymatic pocket might not bind effectively to or inhibit the wild type enzymatic site. Thus, a kinase inhibitor which binds to the wild type enzymatic pocket would be predicted to also block activation by juxtamembrane regulatory region type activating mutations and could therefore be used to treat tumors expressing such mutations, but might not be effective at treating tumors expressing activation loop mutations.

[0026] The present invention provides a method for determining whether a drug of interest will interact with a target enzyme in the tumor cells of a patient. The method is performed by first obtaining a sample of tumor cells from the patient. The manner in which the tumor cells are collected will depend primarily on the type of tumor under consideration. For example, tumor cells from various types of lymphomas, carcinomas and sarcomas may be obtained by conducting a biopsy of the tumor itself, while tumor cells from various types of leukemia may be obtained by collecting a sample of the patient's blood or bone marrow. In the case of familial tumors caused by inherited polymorphisms, non-tumor DNA such as DNA extracted from buccal mucosal epithelial cells or peripheral blood lymphocytes may be used.

[0027] The tumor cells are analyzed to determine the allele of the enzymatic site of the target enzyme. As used herein, the allele of the enzymatic site of the target enzyme refers to the physical characteristics of the enzymatic site arising from the gene encoding the enzyme. The allele of the enzymatic site may be determined using any technique known in the art for identifying the physical characteristics of gene segments and functional proteins. In general, the allele will be determined by analyzing a biomolecule associated with the enzymatic site of the target enzyme. The biomolecule will typically include either a nucleotide sequence of RNA or DNA that encodes at least a portion of the enzymatic site of the enzyme, or a polypeptide fragment from the enzymatic site of the target enzyme, or the target enzyme itself.

[0028] The present invention also provides a method for determining whether a drug of interest will interact with a c-KIT target enzyme in the tumor cells of a patient. The method is performed by first obtaining a sample of tumor cells from the patient. The tumor cells are then analyzed to determine the allele of the enzymatic site of the c-KIT target enzyme. As used herein, the allele of the enzymatic site of the c-KIT target enzyme refers to the physical characteristics of the enzymatic site arising from the gene encoding the c-KIT target enzyme. The allele of the c-KIT target enzyme may be determined using any technique known in the art for identifying the physical characteristics of gene segments and functional proteins. In general, the allele will be determined by analyzing a c-KIT biomolecule associated with the enzymatic site of the c-KIT target enzyme. The c-KIT biomolecule will typically include either a nucleotide sequence of RNA or DNA that encodes at least a portion of the enzymatic site of the c-KIT target enzyme, or a polypeptide fragment from the enzymatic site of the c-KIT target enzyme, or the c-KIT target enzyme itself.

[0029] In one embodiment, the tumor cells are processed in order to analyze the enzymatic site of the target enzyme by determining the amino acid structure forming the target enzyme's enzymatic site. The amino acid structure may be deduced using any one of the many techniques known in the art for deducing amino acid structures, including without limitation the use of monoclonal antibodies, polyclonal antibodies, or by sequencing the isolated polypeptide or the underlying nucleotide sequence encoding the polypeptide.

[0030] For example, monoclonal antibodies may be produced having an antigen-binding region specific to a binding region within the enzymatic site of the target enzyme or a c-KIT reference enzyme as discussed below. Monoclonal antibodies can be produced using well-established hybridoma techniques first introduced by Kohler and Milstein (see, Kohler and Milstein, “Continuous Cultures of Fused Cells Secreting Antibody of Pre-Defined Specificity”, Nature, 256:495-97 (1975)). These techniques involve the injection of an immunogen (e.g., cells or cellular extracts carrying the antigen or purified antigen) into an animal (e.g., mouse) so as to elicit a desired immune response in that animal. After a sufficient time, antibody-producing lymphocytes are obtained from the animal either from the spleen, lymph nodes or peripheral blood. Preferably, lymphocytes are obtained from the spleen. The splenic lymphocytes are then fused with a myeloma cell line, usually in the presence of a fusing agents such as polyethylene glycol (PEG). Any number of myeloma cell lines may be used as a fusion partner according to standard techniques. For example, one such myeloma cell line may include Sp2/0-Ag14 myeloma, non-secreting, mouse cell line (ATCC CRL 1581).

[0031] The resulting cells, which include the desired hybridomas, are then grown in a selective medium, such as HAT medium. In this medium, only successfully fused hybridoma cells survive while unfused parental myeloma or lymphocyte cells die. The surviving cells are then grown under limiting conditions to obtain isolated clones and their supernatants screened for the presence of antibodies having a desired specificity. Positive clones may then be subcloned under limiting dilution conditions and the desired monoclonal antibodies isolated. Hybridomas produced according to these methods can be propagated in vitro or in vivo (in ascites fluid) and purified using common techniques known in the art. Methods for purifying monoclonal antibodies include ammonium sulfate precipitation, ion exchange chromatography, and affinity chromatography (see, e.g., Zola et al., “Techniques for the Production and Characterization of Monoclonal Hybridoma Antibodies”, in Monoclonal Hybridoma Antibodies: Techniques and Applications, pp. 51-52 (Hurell, ed., CRC Press, 1982)).

[0032] Once purified monoclonal antibodies are obtained, epitope mapping may be performed to determine which peptide segment (or antigen-binding region) of the target enzyme is recognized by each particular antibody. The purpose for the epitope mapping is to have a well characterized monoclonal antibody. Ideally, monoclonal antibodies with different specificity to the same target enzyme can be prepared so that researchers have probes for different parts of the target enzyme under investigation. Monoclonal antibodies may also be prepared to provide probes that mimic the binding affinity of various drugs under investigation.

[0033] In a second embodiment, the tumor cells are processed in order to obtain at least a portion of the nucleotide sequence of the enzymatic site of the target enzyme. The nucleotide sequence will be either RNA or DNA (including directly isolated, copied or genomic forms) and may include either polynucleotides encoding a portion of the enzymatic site, polynucleotides encoding the entire enzymatic site, or polynucleotides encoding the entire target enzyme. Such polynucleotides may be either isolated and purified from the other biological materials derived from the sample of tumor cells, or maintained in combination with the other biological materials.

[0034] The manner in which the polynucleotides are maintained will depend primarily upon the techniques used to deduce their nucleotide sequence to determine the allele of the enzymatic site for which they encode. The polynucleotides may be analyzed using any one of the many techniques known in the art for discerning the nucleotide sequence of polynucleotide material. For example, the polynucleotide may be subjected to basic sequencing techniques as are well known in the art. The polynucleotide may also be placed in direct contact with labeled or unlabelled polynucleotide probes designed to deduce the sequence of the polynucleotide or to identify the presence of specific nucleotide sequences. The polynucleotide may also be subjected to nuclear polymorphism analysis using technology such as the Invader technology presently provided by Third Wave Technology, Inc. The polynucleotide may also be placed in contact with a gene chip (microarray) including polynucleotide fragments designed to deduce the sequence of or presence of specific nucleotide sequences. Gene chips and labeled probes may also be used as a platform for performing a direct comparison of the target enzyme's enzymatic site with the enzymatic site of a reference biomolecule known to interact with the drug of interest.

[0035] To determine whether or not a drug of interest will interact with the enzymatic site of the target enzyme in the tumor of the patient, the allele of the enzymatic site of the target enzyme must be compared to a reference providing information regarding the allele of the enzymatic site of a reference enzyme known to interact with the drug of interest. The reference may include either a reference biomolecule, a reference data base or a control, depending upon the techniques utilized in practicing the present invention.

[0036] The reference biomolecule will generally include a biological molecule representing the whole or a portion of the physical structure of the enzymatic site of the reference enzyme. For example, the reference biomolecule may include a polynucleotide encoding the enzymatic site of the reference enzyme, or a polynucleotide encoding a fragment of the enzymatic site of the reference enzyme, including a labeled or unlabelled probe or a gene chip designed to include polynucleotide fragments representing portions of the nucleotide sequence encoding the enzymatic site of the reference enzyme. The reference biomolecule may also include polypeptide fragments from the enzymatic site of the reference enzyme, the reference enzyme itself, or monoclonal or polyclonal antibodies having an antigen-binding region located within the enzymatic site of the reference enzyme.

[0037] The reference database will generally include a selection of data containing information regarding the allele of the enzymatic site of the reference enzyme. For example, the database may include a selection of alleles known to interact, or to not interact, with the drug of interest. The database may also include a listing of several drugs and the alleles for which they have shown the ability to interact or to not interact. The database may also include a listing of the nucleotide sequence or amino acid sequence of those enzymatic sites known to interact, or to not interact, with one or more types of anti-cancer drugs. The database may also include a listing of specific probes and antibodies associated with enzymatic sites known to interact, or to not interact, with one or more such anti-cancer drugs.

[0038] The allele of the enzymatic site of the target enzyme is compared to the reference in order to determine if there exists mutations causing amino acid substitutions, deletions, insertions or inversions at the enzymatic site which cause a change in the ability of the enzymatic site to interact with the drug of interest. For example, the allele of the enzymatic site of the target enzyme could be analyzed to determine the existence of a mutation that causes an amino acid substitution which results in a change in the size or polarity of a segment of the enzyme, or a change in the nature of the side chains of the amino acid(s) forming the enzymatic site. Such an analysis may be performed by simply comparing the nucleotide sequence or amino acid sequence of the enzymatic site of the target enzyme with the nucleotide sequence or amino acid sequence of the enzymatic site of the reference enzyme. The analysis may also be performed using either probes, or monoclonal or polyclonal antibodies, or nucleotide polymorphism techniques as described above. It is also envisioned that other techniques not described herein but well known to those skilled in the art may also be employed to detect differences between the enzymatic sites of the target enzyme and the reference enzyme.

[0039] A finding that a target enzyme is likely to interact or to not interact with a drug of interest may be further enhanced when combined with evidence of expression and/or activation of the target enzyme, using any one of a number of additional studies. An example of one such study may include the detection of specific enzyme activation by identification of the phosphorylation state of the target enzyme. This can be done either in situ on sections of tumor by histochemical or other means, or in vitro, for instance by western blotting or enzyme or ligand/receptor linked immunologic analysis (ELISA) of target enzyme extracted from the tumor.

[0040] A second additional study may also include determining tumor drug sensitivity by direct testing the sensitivity of the target enzyme to specific drugs in vitro. This may be done, for instance, by expression of the target enzyme by transient transfection and determination of the drug sensitivity of the expressed enzyme by western blotting. Because it is more labor intensive and therefore more expensive, this additional component would normally only be used if the target enzyme was shown to be mutated in a way that directly altered its enzymatic site (i.e., if the first tests(s) in this approach showed that the enzymatic site of the target enzyme differed from that of the reference enzyme).

[0041] The present invention differs from previous approaches to determining tumor sensitivity by its unique focus on the primary sequence of the enzymatic site, and depends on the classification system of regulatory type mutations and enzymatic site (enzymatic pocket) mutations as described by Longley et al., “Classes of c-KIT Activating Mutations: Proposed Mechanisms of Action and Implications for Disease Classification and Therapy”, Leukemia Research, (Vienna Mastocytosis Conference, July 2001). This classification system states that oncongenic enzymes that are activated by mutations which affect the enzymatic site of the molecule are more likely to show different drug inhibition profiles than enzymes that are activated by their ligands or by regulatory type mutations/events which do not affect the primary structure of the enzymatic site.

[0042] As a result, the present invention increases the efficiency of in vitro testing because it does not require the demonstration of specific regulatory type mutations or events, which may be more difficult to demonstrate than enzymatic site type mutations. It is also useful in cases of ligand induced activation such as autocrine activation, since identification of this form of enzymatic activation may also be difficult.

[0043] To facilitate in vitro testing, the present invention also encompasses kits for carrying out the method of the present invention. The kit will generally comprise a research tool for use in determining the allele of the target enzyme and a reference for comparing the allele of the target enzyme against a reference enzyme known to interact with one or more drugs of interset. The research tool will generally include materials for isolating and/or analyzing the biomolecule from the tumor of the patient, such as a gene chip, labeled or unlabeled probes, monoclonal or polyclonal antibodies, or materials for sequencing the biomolecule or identifying nucleotide polymorphisms. Kits may be developed for specific application in analyzing various different types of tumors and/or drugs, and will generally vary depending upon the type of reference or research material included. The included reference will of course vary depending upon the type of tumor and/or drug under investigation, and will generally include either a reference biomolecule, a reference database or a control as described above. Ancillary reagents may also be included to facilitate the testing.

[0044] The below examples are merely intended to illustrate the method of the present invention and should not be construed to limited the scope or the spirit of the invention. This invention is not limited to the preferred embodiments and alternatives heretofore described, to which variations and improvements may be made.

EXAMPLES Example 1

[0045] Two subclones of the human mast cell leukemia line HMC1 were used to determine if the wild type and mutant KIT enzymes were able to interact with 2 drug compounds, STI517 and SU9529. Set forth as SEQ ID NO:1 is a nucleotide sequence for the wild type KIT enzyme (amino acid sequence is set forth as SEQ ID NO:2). One subclone, HMC1.1, expressed a valine to glycine substitution in codon 560 (Val560Gly) of the KIT intracellular juxtamembrane region. The other, HMC1.2, expressed the Val560Gly mutation and a second substitution, aspartate to valine in codon 816 (Asp816Val) in the kinase enzymatic pocket. The 2 mutations were originally found together in the HMC1 cell line and were shown to cause SCF-independent constitutive activation of KIT by Furitsu et al., “Identification of mutations in the coding sequence of the proto-oncogene c-KIT in a human mast cell leukemia cell line causing ligand-independent activation of c-KIT product,” J. Clin. Invest., 92:1736-44 (1993). For additional study of wild type and specific mutant forms of KIT, COS cells were used for transient expression of human wild-type and mutated c-KIT complementary DNAs, as described previously by Ma et al., “Inhibition of spontaneous receptor phosphorylation by residues in putatitive α-helix in the KIT intracellular juxtamembrane region,” J. Biol. Chem., 274:13399-13402 (1999). ST1571 is a commercially available drug manufactured by Novartis and developed as an inhibitor of the c-abl gene product. ST1571 has been reported to inhibit wild-type KIT and KIT expressed by HMC1. The other compound, SU9529, is manufactured by SUGEN and has not yet been made commercially available, although it has been published.

[0046] Cells were serum-starved overnight, incubated with or without inhibitors for 1 hour and with or without SCF (200 μg/mL, 10 minutes), followed by immunoprecipitation of cell lysates with anti-KIT antibodies (provided by Dr Keith Langley of Amgen, Thousand Oaks, Calif.), fractionation of proteins by sodium dodecyl sulfate-polyacrylamide gel electrophoresis, and immunoblotting with antiphophotyrosine antibody (Upstate Biotechnology, Lake Placid, N.Y.). Blots were stripped and re-probed with anti-KIT antibody as described previously. Ma et al., supra.

[0047] Cells cultured in the presence or absence of inhibitors were counted daily in a hemocytometer using trypan blue exclusion. Proliferation assays were repeated at least 3 times. Apoptosis was examined using a DNA fragmentation assay. Briefly, cells were grown in the presence or absence of inhibitors, genomic DNA was isolated and separated by agarose gel electrophoresis, and DNA fragments were visualized under UV light by ethidium bromide staining.

[0048] It was discovered that both ST1571 and SU9529 prevent the phosphorylation of wild-type KIT induced by its natural ligand SCF and inhibit SCF-independent constitutive phosphorylation of KIT caused by the Val560Gly juxtamembrane regulatory type (RT) activating mutation at 0.1 to 1 μM (FIG. 1). The antiphosphotyrosine blots of immunoprecipitated KIT expressed in the COS cells exhibited a low level of spontaneous phosphorylation of wild-type (WT) KIT (FIG. 1, lane 1), which increased in response to SCF stimulation (FIG. 1, lane 2). Both inhibitors at 0.1 to 1 μM prevented this ligand-induces phosphorylation (FIG. 1, lanes 3-4). RT mutant KIT with Val560Gly substitution showed a high level of SCF-independent phosphorylation (FIG. 1, lane 5), and the phosphorylation was inhibited by both inhibitors at physiologically achievable (0.1-1 μM) concentrations (FIG. 1, lanes 6-7).

[0049] In contrast, both inhibitors failed to inhibit SCF-independent constitutive phosphorylation of KIT containing the Asp816Val enzymatic pocket-type mutation associated with adult human mastocytosis even at 10 μM (FIG. 1). The enzymatic site type (EST) Asp816Val mutation commonly found in human mastocytosis exhibited high spontaneous phosphorylation (FIG. 1, lane 8) but was resistant to inhibition by either KIT inhibitor at 1 to 10 μM (FIG. 1, lanes 9-10). Substitution of phenylalanine or tyrosine for aspartate 816, rarely found in human mastocytosis, also resulted in high spontaneous phosphorylation (FIG. 1, lanes 11 and 14), but these 2 mutant variants responded to the inhibitors at 1 to 10 μM (FIG. 1, lanes 12-13 and 15-16) unlike Asp816Val KIT. However, their response was still an order of magnitude less sensitive than the regulatory type mutant of the wild-type KIT and are not considered valid therapeutic targets with currently available drugs.

[0050] Similarly, both drugs inhibited spontaneous KIT phosphorylation in the HMC 1.1 subclone, which expresses the Val560Gly activating mutation, but at 10 μM they failed to inhibit spontaneous phosphorylation of KIT in the HMC 1.2 subclone, which expresses both the Val560Gly and Asp816Val activating mutations. As illustrated in FIG. 2, antiphosphotyrosine blots of immunoprecipitated KIT expressed in 2 HMC1 subclones showed that spontaneous phosphorylation of KIT containing only the juxtamembrane RT Val560Gly mutation in the HMC 1.1 clone is susceptible to inhibition by both inhibitors at 0.1 to 1 μM (FIG. 2, lanes 1-3). In contrast, spontaneous phosphorylation of KIT with both the Val560Gly the EST Asp816Val mutation in the HMC1.2 clone was resistant to both inhibitors at 1 to 1.0 μM (FIG. 2, lanes 4-6).

[0051] As would be predicted if the activating mutations caused the proliferation of the mast cells and were necessary for their survival, both inhibitors induced apoptosis of the HMC 1.1 cells, causing the death of this line, but failed to kill the HMC1.2 cells (FIGS. 3 and 4). As depicted in FIG. 3, DNA fragmentation assays showed that only the HMC1.1 clone expressing only the Val560Gly juxtamembrane RT mutation underwent apoptosis, as indicated by formation of DNA “ladders” in the presence of the inhibitors at 0.1 to 1 μM (FIG. 3, lanes 2-3). In contrast, the HMC1.2 clone expressing both the Val560Gly mutation and the Asp816Val EST mutation does not exhibit any significant DNA “ladder” in the presences of the inhibitors at 0.1 to 1 μM (FIG. 3, lanes 5-6). As illustrated in FIG. 4, cell proliferation assays showed that incubation of HMC1.1 cells with inhibitors at 0.1 to 1 μM for a 3-day period kills the cells, while treatment of the HMC 1.2 cells with the inhibitors at 1 to 10 μM only slightly inhibits growth of the cells.

[0052] Two rare cases of human mastocytosis have been previously described in which other amino acids besides valine are substituted for Asp816 in the enzymatic pocket of the KIT kinase. These 2 variants, involving substitution of either tyrosine or phenylalanine, also cause SCF-independent constitutive phosphorylation of KIT. The 2 mutants, however, were partially inhibited by the KIT inhibitors at 1 to 10 μM (FIG. 1, lanes 11-16), concentrations which are totally ineffective against the most common Asp816Val substituted mutant (FIG. 1, lanes 8-10). Both of these variant enzymatic site type (EST) mutant KITs are an order of magnitude less sensitive than the wild-type or regulatory type (RT) mutant KIT, and neither of these inhibitors appeared to have a high enough therapeutic index to be valid candidates for inhibiting the mutant kinases in a clinical trial. Nevertheless, the data does show that KIT kinases with different residue substitutions in codon 816 of the enzymatic pocket are likely to exhibit differences in susceptibility to specific pharmacologic inhibitors.

[0053] The data unequivocally showed that different classes of activating KIT mutations responded differentially to different inhibitors. The data extends previous studies of nonhuman mammalian KIT-activating mutations to the actual mutations found in various forms of human disease and supports the proposals for classification of mutations as either “regulatory type” or “enzymatic site type” mutations, and for classification of human diseases according to the type of the mutations expressed in specific tumors.

[0054] Current results also highlight the need to identify specific variants of mutant KIT expressed by individual patients when one contemplates rational therapy. The finding that different amino acid substitutions in codon 816 of the enzymatic pocket represents the first documentation with human mutant activated KIT to support the individualization of drug therapy based on the response of specific mutant proteins to specific drugs. Accordingly, the data suggests that despite the previously reported ability of the KIT inhibitor STI571 to kill an HMC1 line at 1 μM, currently available KIT inhibitors may be ineffective in treating human adult-type mastocytosis. On the other hand, neoplastic processes characterized by RT KIT-activating mutations, such as gastrointestinal stromal tumors, should be susceptible to inhibition by a relatively wide variety of inhibitors, including those that inhibit wild-type KIT.

[0055] Different RT mutations, in a given species, show similar sensitivities regardless of the specific amino acid substitution 10 (additional data not shown). This observation supports the concept that the enzymatic pocket in the RT mutants does not differ significantly from the enzymatic pocket of wild-type KIT. It follows that a drug that is a “good fit” for the wild-type enzymatic pocket and is capable of sterically blocking the enzymatic reaction would be likely to be also effective against a RT mutant but would not necessarily be effective against an enzymatic pocket-type mutant. This concept may aid in identifying potentially clinically useful drugs.

[0056] The data presented also sheds a unique perspective on the cause of mast cell neoplasms. The fact that the HMC1.1 and 1.2 cell lines are only known to differ by the presence or absence of the Asp816Val mutation makes them an ideal model for determining the role of KIT activation in the factor-independent growth and survival of these cells. The key observation is the fact that both drugs inhibit the RT activating KIT mutation, and both drugs are capable of killing the HMC1.2 cell line, which expresses only that mutation. However, neither of these drugs are capable of inhibiting the enzymatic pocket-type mutation found in the HMC 1.1 clone, and neither are capable of inhibiting the growth and survival of that cell line. Together, these observations show that the ability to kill neoplastic mast cells expressing activated KIT is associated with the ability to inhibit the mutated, activated KIT rather than to inhibit some other unknown kinases.

Example 2 (Prophetic)

[0057] Raf-1 (nucleotide and amino acid sequences are set forth as SEQ ID NO:3 and SEQ ID NO:4, respectively) is a serine-threonine protein kinase that, when activated, phosphorylates and activates mitogen activated protein kinase/extracellular signal related kinase (ERK) kinases (MEK1 and MEK2), stimulating a cascade of intracellular signaling that can influence cell growth and differentiation, and promote neoplastic transformation. Induction of a mutation in codon 361 (G361 S), which causes substitution of the third glycine of the GXGXXG ATP binding motif characteristic of kinase enzymatic sites, results in constitutive activation of MEK and promotes transformation in vitro (Chan et al, “Mutations in conserved regions 1, 2, and 3 of Raf-1 that activate transforming activity,” Molecular Carcinogenesis, 33:189-197 (2002)). Although this mutation has not been described in nature, it is anticipated that its occurrence may contribute to aggressive behavior of a tumor expressing the mutation and would, therefore, represent a potential therapeutic target. However, it is postulated that ATP analogue drugs that bind to the wild type Raf-1 may not also bind to the altered enzymatic site in this enzyme. As a result, such a drug may not be useful as a first line defense against the tumor.

[0058] To predict if a particular ATP analogue drug will be effective in treating the tumor of a patient, a biopsy of the tumor is performed. The tumor cells are isolated from the sample and processed in order to recover the cells DNA. The nucleotides encoding the enzymatic site are sequenced and compared to that of a reference Raf-1 enzyme. If the nucleotide sequence of the target Raf-1 enzyme includes a mutation that is known to alter the enzymatic site in a manner that precludes the ATP analogue drug from binding to the enzymatic site, the drug is removed from consideration as a possible candidate for use as a therapeutic drug in that patient. If the nucleotide sequence of the target Raf-1 enzyme includes a mutation that has yet to be defined as inhibitory or permissive with respect to binding of the ATP analogue drug, the target Raf-1 enzyme is subjected to further testing to determine its sensitivity to the ATP analogue drug.

[0059] Alternatively, using a labeled probe designed to recognize and bind the Raf-1 protein kinase the Raf-1 target enzyme from the patient is isolated and its enzymatic site sequenced and compared to wild type Raf-1 known to have its activity altered by the ATP analogue drug. If the nucleotide sequence of both enzymes are identical or similar to the extent that there exists no change in the amino acid structure of the enzymatic site, the drug is considered to be a candidate for use as a therapeutic drug for treating the tumor.

[0060] Additional sensitivity testing is conducted by causing the expression of the target Raf-1 enzyme by transient transfection and then measuring the sensitivity of the target Raf-1 enzyme to the ATP analogue drug through western blotting. Positive sensitivity results indicate that although the nucleotide sequences of the target Raf-1 enzyme and the wild-type Raf-1 enzyme differ, the mutation does not alter the enzymatic site in a manner such that it inhibits the ability of the ATP analogue drug to alter the activity of the target Raf-1 enzyme. As a result, the ATP analogue drug may be considered to be a candidate for use as a therapeutic drug for treating the tumor.

[0061] Example 3 (Prophetic)

[0062] Chronic Mylogenic Leukemia (CML) typically arises from a reciprocal translocation between one chromosome 9 and one chromosome 22 of the patient. The translocation results in the modified chromosome 9 being longer than normal and the modified chromosome 22 (referred to as the Philadelphia chromosome) being shorter than normal. The DNA removed from chromosome 9 contains most of the proto-oncogene designated c-ABL, while the break in chromosome 22 occurs in the middle of the gene designated BCR. The resulting Philadelphia chromosome has the 5′ section of BCR fused with most of the c-ABL oncogene. Transcription and translation of the hybrid BCR-ABL gene produces an abnormal (“fusion”) protein that activates constitutively a number of cell activities that normally are turned on only when the cell is stimulated by a growth factor, such as platelet-derived growth factor (PDGF).

[0063] The BCR-ABL fusion protein is a tyrosine kinase having its enzymatic site derived from the c-ABL portion. Set forth as SEQ ID NO:5 is the typical nucleotide sequence for the c-ABL portion (amino acid sequence is set forth as SEQ ID NO:6). Mutations to the enzymatic site have been identified, and are believed to be associated with decreased sensitivity to the drug Imatinab. Using the method of the present invention, a preliminary determination can be made as to whether Imatinab, or some other cancer drug, may be effective in treating a patient with chronic leukemia by determining the presence or absence of these mutations.

[0064] As described in the examples above, a biopsy of the tumor is performed, with the tumor cells isolated and processed in order to recover their DNA. The nucleotides encoding the enzymatic site are then sequenced and compared to that of a reference BCR-ABL enzyme using typical sequencing techniques or probes (whether by microarray or other method) designed to detect variations in the amino acids or nucleotide sequence associated with the BCR-ABL enzymatic site. If the nucleotide sequence of the target BCR-ABL enzyme includes a mutation that is known to alter the enzymatic site in a manner that precludes the BCR-ABL analogue drug from binding to the enzymatic site (such as mutations to amino acids 244-255 or 311-359 forming the sides of the enzymatic pocket, or amino acids 381-402 forming the enzymatic pocket's activation loop), the drug is removed from consideration as a possible candidate. If the nucleotide sequence includes a mutation that has yet to be defined as inhibitory or permissive with respect to the ability of the BCR-ABL analogue drug to bind, the target BCR-ABL enzyme is subjected to further testing to determine its sensitivity to the analogue drug.

Example 4 (Prophetic)

[0065] The FMS-like tyrosine kinase 3 (FLT3) is a receptor tyrosine kinase that is expressed on early hematopoetic and lymphoid precursor cells, and plays an important role in cell survival and differentiation. Set forth as SEQ ID NO:7 is a nucleotide sequence for the wild-type flt3 gene (amino acid sequence is set forth as SEQ ID NO:8). FLT3 may be activated by mutations called internal tandem duplications (ITDs) that affect the intracellular juxtamembrane of FLT3 (Kiyio et al., “Internal tandem duplication of the FLT3 gene is a novel modality of elongation mutation which causes constitutive activation of the product,” Leukemia 12:1333-37 (1998)), and by point mutations affecting the tyrosine kinase domain, particularly those in codon 835 and 836, which encode amino acids helping to form the enzymatic site of the FLT3 kinase (Yamamoto et al., “Activating mutation of D835 within the activation loop of FLT3 in human hematologic malignancies,” Blood, 97:2434-39 (2001)). FLT3 is expressed by the leukemic cells of most patients with Acute Myeloid Leukemia (AML). FLT3 activating mutations are found in approximately 20% of AML Patients, and their presence is associated with a poorer prognosis (Thiede et al., “Analysis of FLT3-activating mutations in 979 patients with acute myelogenous leukemia: Association with FAB subtypes and identification of subgroups with poor prognosis,” Blood, 99(12):4326-35 (2002)), an observation which suggests that activated FLT3 contributes to disease progression and would be a valid therapeutic target in AML patients.

[0066] To predict if a particular drug will be effective in treating a particular FLT3-type AML patient, a sample of the leukemic cells from the patient are collected and processed in order to recover the cells DNA. The DNA is then exposed to a gene chip designed to include individual nucleotide probes representing various polynucleotide sequences found in the enzymatic site of various FLT3 enzymes known to either interact with or to not interact with the particular drug under investigation. Depending upon the resulting binding pattern on the gene chip, the drug is determined to be either a candidate or not a candidate for use as a therapeutic drug for the patient.

[0067] The Longley model of activating mutations predicts that drugs that are ATP competitive inhibitors of wild type FLT3 will be effective inhibitors of FLT3 kinase activated by its natural ligand or by regulatory mutations such as those affecting its expression levels or those affecting the intra-molecular regulation of the kinase, typified by the juxtamembrane region ITDs. Inhibitors effective against wild-type FLT3 are, therefore, candidates for treatment of AML in patients whose leukemic cells express FLT3 activated via these mechanisms. However, patients whose leukemic cells express FLT3 activated by mutations affecting the enzymatic site of FLT3, such as those in codons 835 or 836, are more likely to be resistant to such inhibitors, and not a candidate for treatment with the drug absent further testing.

[0068] Cells expressing FLT3 activation by mutations affecting the enzymatic site of FLT3 are subjected to additional testing to determine if the altered FLT3 enzymatic site might represent a special therapeutic target found only in leukemic cells. Drug sensitivity testing is conducted by causing the expression of the altered FLT3 enzyme by transient transfection and then measuring the sensitivity of the altered FLT3 enzyme to the drug through western blotting. If the drug only inhibited the mutant enzymatic site and not the wild type enzyme, it may be a candidate for killing leukemic cells with minimal side effects.

1 8 1 2931 DNA Homo sapiens CDS (1)..(2928) misc_structure (1668)..(1670) codon 560 that can mutate to encode Glycine 1 atg aga ggc gct cgc ggc gcc tgg gat ttt ctc tgc gtt ctg ctc cta 48 Met Arg Gly Ala Arg Gly Ala Trp Asp Phe Leu Cys Val Leu Leu Leu 1 5 10 15 ctg ctt cgc gtc cag aca ggc tct tct caa cca tct gtg agt cca ggg 96 Leu Leu Arg Val Gln Thr Gly Ser Ser Gln Pro Ser Val Ser Pro Gly 20 25 30 gaa ccg tct cca cca tcc atc cat cca gga aaa tca gac tta ata gtc 144 Glu Pro Ser Pro Pro Ser Ile His Pro Gly Lys Ser Asp Leu Ile Val 35 40 45 cgc gtg ggc gac gag att agg ctg tta tgc act gat ccg ggc ttt gtc 192 Arg Val Gly Asp Glu Ile Arg Leu Leu Cys Thr Asp Pro Gly Phe Val 50 55 60 aaa tgg act ttt gag atc ctg gat gaa acg aat gag aat aag cag aat 240 Lys Trp Thr Phe Glu Ile Leu Asp Glu Thr Asn Glu Asn Lys Gln Asn 65 70 75 80 gaa tgg atc acg gaa aag gca gaa gcc acc aac acc ggc aaa tac acg 288 Glu Trp Ile Thr Glu Lys Ala Glu Ala Thr Asn Thr Gly Lys Tyr Thr 85 90 95 tgc acc aac aaa cac ggc tta agc aat tcc att tat gtg ttt gtt aga 336 Cys Thr Asn Lys His Gly Leu Ser Asn Ser Ile Tyr Val Phe Val Arg 100 105 110 gat cct gcc aag ctt ttc ctt gtt gac cgc tcc ttg tat ggg aaa gaa 384 Asp Pro Ala Lys Leu Phe Leu Val Asp Arg Ser Leu Tyr Gly Lys Glu 115 120 125 gac aac gac acg ctg gtc cgc tgt cct ctc aca gac cca gaa gtg acc 432 Asp Asn Asp Thr Leu Val Arg Cys Pro Leu Thr Asp Pro Glu Val Thr 130 135 140 aat tat tcc ctc aag ggg tgc cag ggg aag cct ctt ccc aag gac ttg 480 Asn Tyr Ser Leu Lys Gly Cys Gln Gly Lys Pro Leu Pro Lys Asp Leu 145 150 155 160 agg ttt att cct gac ccc aag gcg ggc atc atg atc aaa agt gtg aaa 528 Arg Phe Ile Pro Asp Pro Lys Ala Gly Ile Met Ile Lys Ser Val Lys 165 170 175 cgc gcc tac cat cgg ctc tgt ctg cat tgt tct gtg gac cag gag ggc 576 Arg Ala Tyr His Arg Leu Cys Leu His Cys Ser Val Asp Gln Glu Gly 180 185 190 aag tca gtg ctg tcg gaa aaa ttc atc ctg aaa gtg agg cca gcc ttc 624 Lys Ser Val Leu Ser Glu Lys Phe Ile Leu Lys Val Arg Pro Ala Phe 195 200 205 aaa gct gtg cct gtt gtg tct gtg tcc aaa gca agc tat ctt ctt agg 672 Lys Ala Val Pro Val Val Ser Val Ser Lys Ala Ser Tyr Leu Leu Arg 210 215 220 gaa ggg gaa gaa ttc aca gtg acg tgc aca ata aaa gat gtg tct agt 720 Glu Gly Glu Glu Phe Thr Val Thr Cys Thr Ile Lys Asp Val Ser Ser 225 230 235 240 tct gtg tac tca acg tgg aaa aga gaa aac agt cag act aaa cta cag 768 Ser Val Tyr Ser Thr Trp Lys Arg Glu Asn Ser Gln Thr Lys Leu Gln 245 250 255 gag aaa tat aat agc tgg cat cac ggt gac ttc aat tat gaa cgt cag 816 Glu Lys Tyr Asn Ser Trp His His Gly Asp Phe Asn Tyr Glu Arg Gln 260 265 270 gca acg ttg act atc agt tca gcg aga gtt aat gat tct gga gtg ttc 864 Ala Thr Leu Thr Ile Ser Ser Ala Arg Val Asn Asp Ser Gly Val Phe 275 280 285 atg tgt tat gcc aat aat act ttt gga tca gca aat gtc aca aca acc 912 Met Cys Tyr Ala Asn Asn Thr Phe Gly Ser Ala Asn Val Thr Thr Thr 290 295 300 ttg gaa gta gta gat aaa gga ttc att aat atc ttc ccc atg ata aac 960 Leu Glu Val Val Asp Lys Gly Phe Ile Asn Ile Phe Pro Met Ile Asn 305 310 315 320 act aca gta ttt gta aac gat gga gaa aat gta gat ttg att gtt gaa 1008 Thr Thr Val Phe Val Asn Asp Gly Glu Asn Val Asp Leu Ile Val Glu 325 330 335 tat gaa gca ttc ccc aaa cct gaa cac cag cag tgg atc tat atg aac 1056 Tyr Glu Ala Phe Pro Lys Pro Glu His Gln Gln Trp Ile Tyr Met Asn 340 345 350 aga acc ttc act gat aaa tgg gaa gat tat ccc aag tct gag aat gaa 1104 Arg Thr Phe Thr Asp Lys Trp Glu Asp Tyr Pro Lys Ser Glu Asn Glu 355 360 365 agt aat atc aga tac gta agt gaa ctt cat cta acg aga tta aaa ggc 1152 Ser Asn Ile Arg Tyr Val Ser Glu Leu His Leu Thr Arg Leu Lys Gly 370 375 380 acc gaa gga ggc act tac aca ttc cta gtg tcc aat tct gac gtc aat 1200 Thr Glu Gly Gly Thr Tyr Thr Phe Leu Val Ser Asn Ser Asp Val Asn 385 390 395 400 gct gcc ata gca ttt aat gtt tat gtg aat aca aaa cca gaa atc ctg 1248 Ala Ala Ile Ala Phe Asn Val Tyr Val Asn Thr Lys Pro Glu Ile Leu 405 410 415 act tac gac agg ctc gtg aat ggc atg ctc caa tgt gtg gca gca gga 1296 Thr Tyr Asp Arg Leu Val Asn Gly Met Leu Gln Cys Val Ala Ala Gly 420 425 430 ttc cca gag ccc aca ata gat tgg tat ttt tgt cca gga act gag cag 1344 Phe Pro Glu Pro Thr Ile Asp Trp Tyr Phe Cys Pro Gly Thr Glu Gln 435 440 445 aga tgc tct gct tct gta ctg cca gtg gat gtg cag aca cta aac tca 1392 Arg Cys Ser Ala Ser Val Leu Pro Val Asp Val Gln Thr Leu Asn Ser 450 455 460 tct ggg cca ccg ttt gga aag cta gtg gtt cag agt tct ata gat tct 1440 Ser Gly Pro Pro Phe Gly Lys Leu Val Val Gln Ser Ser Ile Asp Ser 465 470 475 480 agt gca ttc aag cac aat ggc acg gtt gaa tgt aag gct tac aac gat 1488 Ser Ala Phe Lys His Asn Gly Thr Val Glu Cys Lys Ala Tyr Asn Asp 485 490 495 gtg ggc aag act tct gcc tat ttt aac ttt gca ttt aaa ggt aac aac 1536 Val Gly Lys Thr Ser Ala Tyr Phe Asn Phe Ala Phe Lys Gly Asn Asn 500 505 510 aaa gag caa atc cat ccc cac acc ctg ttc act cct ttg ctg att ggt 1584 Lys Glu Gln Ile His Pro His Thr Leu Phe Thr Pro Leu Leu Ile Gly 515 520 525 ttc gta atc gta gct ggc atg atg tgc att att gtg atg att ctg acc 1632 Phe Val Ile Val Ala Gly Met Met Cys Ile Ile Val Met Ile Leu Thr 530 535 540 tac aaa tat tta cag aaa ccc atg tat gaa gta cag tgg aag gtt gtt 1680 Tyr Lys Tyr Leu Gln Lys Pro Met Tyr Glu Val Gln Trp Lys Val Val 545 550 555 560 gag gag ata aat gga aac aat tat gtt tac ata gac cca aca caa ctt 1728 Glu Glu Ile Asn Gly Asn Asn Tyr Val Tyr Ile Asp Pro Thr Gln Leu 565 570 575 cct tat gat cac aaa tgg gag ttt ccc aga aac agg ctg agt ttt ggg 1776 Pro Tyr Asp His Lys Trp Glu Phe Pro Arg Asn Arg Leu Ser Phe Gly 580 585 590 aaa acc ctg ggt gct gga gct ttc ggg aag gtt gtt gag gca act gct 1824 Lys Thr Leu Gly Ala Gly Ala Phe Gly Lys Val Val Glu Ala Thr Ala 595 600 605 tat ggc tta att aag tca gat gcg gcc atg act gtc gct gta aag atg 1872 Tyr Gly Leu Ile Lys Ser Asp Ala Ala Met Thr Val Ala Val Lys Met 610 615 620 ctc aag ccg agt gcc cat ttg aca gaa cgg gaa gcc ctc atg tct gaa 1920 Leu Lys Pro Ser Ala His Leu Thr Glu Arg Glu Ala Leu Met Ser Glu 625 630 635 640 ctc aaa gtc ctg agt tac ctt ggt aat cac atg aat att gtg aat cta 1968 Leu Lys Val Leu Ser Tyr Leu Gly Asn His Met Asn Ile Val Asn Leu 645 650 655 ctt gga gcc tgc acc att gga ggg ccc acc ctg gtc att aca gaa tat 2016 Leu Gly Ala Cys Thr Ile Gly Gly Pro Thr Leu Val Ile Thr Glu Tyr 660 665 670 tgt tgc tat ggt gat ctt ttg aat ttt ttg aga aga aaa cgt gat tca 2064 Cys Cys Tyr Gly Asp Leu Leu Asn Phe Leu Arg Arg Lys Arg Asp Ser 675 680 685 ttt att tgt tca aag cag gaa gat cat gca gaa gct gca ctt tat aag 2112 Phe Ile Cys Ser Lys Gln Glu Asp His Ala Glu Ala Ala Leu Tyr Lys 690 695 700 aat ctt ctg cat tca aag gag tct tcc tgc agc gat agt act aat gag 2160 Asn Leu Leu His Ser Lys Glu Ser Ser Cys Ser Asp Ser Thr Asn Glu 705 710 715 720 tac atg gac atg aaa cct gga gtt tct tat gtt gtc cca acc aag gcc 2208 Tyr Met Asp Met Lys Pro Gly Val Ser Tyr Val Val Pro Thr Lys Ala 725 730 735 gac aaa agg aga tct gtg aga ata ggc tca tac ata gaa aga gat gtg 2256 Asp Lys Arg Arg Ser Val Arg Ile Gly Ser Tyr Ile Glu Arg Asp Val 740 745 750 act ccc gcc atc atg gag gat gac gag ttg gcc cta gac tta gaa gac 2304 Thr Pro Ala Ile Met Glu Asp Asp Glu Leu Ala Leu Asp Leu Glu Asp 755 760 765 ttg ctg agc ttt tct tac cag gtg gca aag ggc atg gct ttc ctc gcc 2352 Leu Leu Ser Phe Ser Tyr Gln Val Ala Lys Gly Met Ala Phe Leu Ala 770 775 780 tcc aag aat tgt att cac aga gac ttg gca gcc aga aat atc ctc ctt 2400 Ser Lys Asn Cys Ile His Arg Asp Leu Ala Ala Arg Asn Ile Leu Leu 785 790 795 800 act cat ggt cgg atc aca aag att tgt gat ttt ggt cta gcc aga gac 2448 Thr His Gly Arg Ile Thr Lys Ile Cys Asp Phe Gly Leu Ala Arg Asp 805 810 815 atc aag aat gat tct aat tat gtg gtt aaa gga aac gct cga cta cct 2496 Ile Lys Asn Asp Ser Asn Tyr Val Val Lys Gly Asn Ala Arg Leu Pro 820 825 830 gtg aag tgg atg gca cct gaa agc att ttc aac tgt gta tac acg ttt 2544 Val Lys Trp Met Ala Pro Glu Ser Ile Phe Asn Cys Val Tyr Thr Phe 835 840 845 gaa agt gac gtc tgg tcc tat ggg att ttt ctt tgg gag ctg ttc tct 2592 Glu Ser Asp Val Trp Ser Tyr Gly Ile Phe Leu Trp Glu Leu Phe Ser 850 855 860 tta gga agc agc ccc tat cct gga atg ccg gtc gat tct aag ttc tac 2640 Leu Gly Ser Ser Pro Tyr Pro Gly Met Pro Val Asp Ser Lys Phe Tyr 865 870 875 880 aag atg atc aag gaa ggc ttc cgg atg ctc agc cct gaa cac gca cct 2688 Lys Met Ile Lys Glu Gly Phe Arg Met Leu Ser Pro Glu His Ala Pro 885 890 895 gct gaa atg tat gac ata atg aag act tgc tgg gat gca gat ccc cta 2736 Ala Glu Met Tyr Asp Ile Met Lys Thr Cys Trp Asp Ala Asp Pro Leu 900 905 910 aaa aga cca aca ttc aag caa att gtt cag cta att gag aag cag att 2784 Lys Arg Pro Thr Phe Lys Gln Ile Val Gln Leu Ile Glu Lys Gln Ile 915 920 925 tca gag agc acc aat cat att tac tcc aac tta gca aac tgc agc ccc 2832 Ser Glu Ser Thr Asn His Ile Tyr Ser Asn Leu Ala Asn Cys Ser Pro 930 935 940 aac cga cag aag ccc gtg gta gac cat tct gtg cgg atc aat tct gtc 2880 Asn Arg Gln Lys Pro Val Val Asp His Ser Val Arg Ile Asn Ser Val 945 950 955 960 ggc agc acc gct tcc tcc tcc cag cct ctg ctt gtg cac gac gat gtc 2928 Gly Ser Thr Ala Ser Ser Ser Gln Pro Leu Leu Val His Asp Asp Val 965 970 975 tga 2931 2 976 PRT Homo sapiens 2 Met Arg Gly Ala Arg Gly Ala Trp Asp Phe Leu Cys Val Leu Leu Leu 1 5 10 15 Leu Leu Arg Val Gln Thr Gly Ser Ser Gln Pro Ser Val Ser Pro Gly 20 25 30 Glu Pro Ser Pro Pro Ser Ile His Pro Gly Lys Ser Asp Leu Ile Val 35 40 45 Arg Val Gly Asp Glu Ile Arg Leu Leu Cys Thr Asp Pro Gly Phe Val 50 55 60 Lys Trp Thr Phe Glu Ile Leu Asp Glu Thr Asn Glu Asn Lys Gln Asn 65 70 75 80 Glu Trp Ile Thr Glu Lys Ala Glu Ala Thr Asn Thr Gly Lys Tyr Thr 85 90 95 Cys Thr Asn Lys His Gly Leu Ser Asn Ser Ile Tyr Val Phe Val Arg 100 105 110 Asp Pro Ala Lys Leu Phe Leu Val Asp Arg Ser Leu Tyr Gly Lys Glu 115 120 125 Asp Asn Asp Thr Leu Val Arg Cys Pro Leu Thr Asp Pro Glu Val Thr 130 135 140 Asn Tyr Ser Leu Lys Gly Cys Gln Gly Lys Pro Leu Pro Lys Asp Leu 145 150 155 160 Arg Phe Ile Pro Asp Pro Lys Ala Gly Ile Met Ile Lys Ser Val Lys 165 170 175 Arg Ala Tyr His Arg Leu Cys Leu His Cys Ser Val Asp Gln Glu Gly 180 185 190 Lys Ser Val Leu Ser Glu Lys Phe Ile Leu Lys Val Arg Pro Ala Phe 195 200 205 Lys Ala Val Pro Val Val Ser Val Ser Lys Ala Ser Tyr Leu Leu Arg 210 215 220 Glu Gly Glu Glu Phe Thr Val Thr Cys Thr Ile Lys Asp Val Ser Ser 225 230 235 240 Ser Val Tyr Ser Thr Trp Lys Arg Glu Asn Ser Gln Thr Lys Leu Gln 245 250 255 Glu Lys Tyr Asn Ser Trp His His Gly Asp Phe Asn Tyr Glu Arg Gln 260 265 270 Ala Thr Leu Thr Ile Ser Ser Ala Arg Val Asn Asp Ser Gly Val Phe 275 280 285 Met Cys Tyr Ala Asn Asn Thr Phe Gly Ser Ala Asn Val Thr Thr Thr 290 295 300 Leu Glu Val Val Asp Lys Gly Phe Ile Asn Ile Phe Pro Met Ile Asn 305 310 315 320 Thr Thr Val Phe Val Asn Asp Gly Glu Asn Val Asp Leu Ile Val Glu 325 330 335 Tyr Glu Ala Phe Pro Lys Pro Glu His Gln Gln Trp Ile Tyr Met Asn 340 345 350 Arg Thr Phe Thr Asp Lys Trp Glu Asp Tyr Pro Lys Ser Glu Asn Glu 355 360 365 Ser Asn Ile Arg Tyr Val Ser Glu Leu His Leu Thr Arg Leu Lys Gly 370 375 380 Thr Glu Gly Gly Thr Tyr Thr Phe Leu Val Ser Asn Ser Asp Val Asn 385 390 395 400 Ala Ala Ile Ala Phe Asn Val Tyr Val Asn Thr Lys Pro Glu Ile Leu 405 410 415 Thr Tyr Asp Arg Leu Val Asn Gly Met Leu Gln Cys Val Ala Ala Gly 420 425 430 Phe Pro Glu Pro Thr Ile Asp Trp Tyr Phe Cys Pro Gly Thr Glu Gln 435 440 445 Arg Cys Ser Ala Ser Val Leu Pro Val Asp Val Gln Thr Leu Asn Ser 450 455 460 Ser Gly Pro Pro Phe Gly Lys Leu Val Val Gln Ser Ser Ile Asp Ser 465 470 475 480 Ser Ala Phe Lys His Asn Gly Thr Val Glu Cys Lys Ala Tyr Asn Asp 485 490 495 Val Gly Lys Thr Ser Ala Tyr Phe Asn Phe Ala Phe Lys Gly Asn Asn 500 505 510 Lys Glu Gln Ile His Pro His Thr Leu Phe Thr Pro Leu Leu Ile Gly 515 520 525 Phe Val Ile Val Ala Gly Met Met Cys Ile Ile Val Met Ile Leu Thr 530 535 540 Tyr Lys Tyr Leu Gln Lys Pro Met Tyr Glu Val Gln Trp Lys Val Val 545 550 555 560 Glu Glu Ile Asn Gly Asn Asn Tyr Val Tyr Ile Asp Pro Thr Gln Leu 565 570 575 Pro Tyr Asp His Lys Trp Glu Phe Pro Arg Asn Arg Leu Ser Phe Gly 580 585 590 Lys Thr Leu Gly Ala Gly Ala Phe Gly Lys Val Val Glu Ala Thr Ala 595 600 605 Tyr Gly Leu Ile Lys Ser Asp Ala Ala Met Thr Val Ala Val Lys Met 610 615 620 Leu Lys Pro Ser Ala His Leu Thr Glu Arg Glu Ala Leu Met Ser Glu 625 630 635 640 Leu Lys Val Leu Ser Tyr Leu Gly Asn His Met Asn Ile Val Asn Leu 645 650 655 Leu Gly Ala Cys Thr Ile Gly Gly Pro Thr Leu Val Ile Thr Glu Tyr 660 665 670 Cys Cys Tyr Gly Asp Leu Leu Asn Phe Leu Arg Arg Lys Arg Asp Ser 675 680 685 Phe Ile Cys Ser Lys Gln Glu Asp His Ala Glu Ala Ala Leu Tyr Lys 690 695 700 Asn Leu Leu His Ser Lys Glu Ser Ser Cys Ser Asp Ser Thr Asn Glu 705 710 715 720 Tyr Met Asp Met Lys Pro Gly Val Ser Tyr Val Val Pro Thr Lys Ala 725 730 735 Asp Lys Arg Arg Ser Val Arg Ile Gly Ser Tyr Ile Glu Arg Asp Val 740 745 750 Thr Pro Ala Ile Met Glu Asp Asp Glu Leu Ala Leu Asp Leu Glu Asp 755 760 765 Leu Leu Ser Phe Ser Tyr Gln Val Ala Lys Gly Met Ala Phe Leu Ala 770 775 780 Ser Lys Asn Cys Ile His Arg Asp Leu Ala Ala Arg Asn Ile Leu Leu 785 790 795 800 Thr His Gly Arg Ile Thr Lys Ile Cys Asp Phe Gly Leu Ala Arg Asp 805 810 815 Ile Lys Asn Asp Ser Asn Tyr Val Val Lys Gly Asn Ala Arg Leu Pro 820 825 830 Val Lys Trp Met Ala Pro Glu Ser Ile Phe Asn Cys Val Tyr Thr Phe 835 840 845 Glu Ser Asp Val Trp Ser Tyr Gly Ile Phe Leu Trp Glu Leu Phe Ser 850 855 860 Leu Gly Ser Ser Pro Tyr Pro Gly Met Pro Val Asp Ser Lys Phe Tyr 865 870 875 880 Lys Met Ile Lys Glu Gly Phe Arg Met Leu Ser Pro Glu His Ala Pro 885 890 895 Ala Glu Met Tyr Asp Ile Met Lys Thr Cys Trp Asp Ala Asp Pro Leu 900 905 910 Lys Arg Pro Thr Phe Lys Gln Ile Val Gln Leu Ile Glu Lys Gln Ile 915 920 925 Ser Glu Ser Thr Asn His Ile Tyr Ser Asn Leu Ala Asn Cys Ser Pro 930 935 940 Asn Arg Gln Lys Pro Val Val Asp His Ser Val Arg Ile Asn Ser Val 945 950 955 960 Gly Ser Thr Ala Ser Ser Ser Gln Pro Leu Leu Val His Asp Asp Val 965 970 975 3 1947 DNA Homo sapiens CDS (1)..(1944) misc_structure (1081)..(1083) codon 361 that can mutate to encode Serine 3 atg gag cac ata cag gga gct tgg aag acg atc agc aat ggt ttt gga 48 Met Glu His Ile Gln Gly Ala Trp Lys Thr Ile Ser Asn Gly Phe Gly 1 5 10 15 ttc aaa gat gcc gtg ttt gat ggc tcc agc tgc atc tct cct aca ata 96 Phe Lys Asp Ala Val Phe Asp Gly Ser Ser Cys Ile Ser Pro Thr Ile 20 25 30 gtt cag cag ttt ggc tat cag cgc cgg gca tca gat gat ggc aaa ctc 144 Val Gln Gln Phe Gly Tyr Gln Arg Arg Ala Ser Asp Asp Gly Lys Leu 35 40 45 aca gat cct tct aag aca agc aac act atc cgt gtt ttc ttg ccg aac 192 Thr Asp Pro Ser Lys Thr Ser Asn Thr Ile Arg Val Phe Leu Pro Asn 50 55 60 aag caa aga aca gtg gtc aat gtg cga aat gga atg agc ttg cat gac 240 Lys Gln Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 65 70 75 80 tgc ctt atg aaa gca ctc aag gtg agg ggc ctg caa cca gag tgc tgt 288 Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gln Pro Glu Cys Cys 85 90 95 gca gtg ttc aga ctt ctc cac gaa cac aaa ggt aaa aaa gca cgc tta 336 Ala Val Phe Arg Leu Leu His Glu His Lys Gly Lys Lys Ala Arg Leu 100 105 110 gat tgg aat act gat gct gcg tct ttg att gga gaa gaa ctt caa gta 384 Asp Trp Asn Thr Asp Ala Ala Ser Leu Ile Gly Glu Glu Leu Gln Val 115 120 125 gat ttc ctg gat cat gtt ccc ctc aca aca cac aac ttt gct cgg aag 432 Asp Phe Leu Asp His Val Pro Leu Thr Thr His Asn Phe Ala Arg Lys 130 135 140 acg ttc ctg aag ctt gcc ttc tgt gac atc tgt cag aaa ttc ctg ctc 480 Thr Phe Leu Lys Leu Ala Phe Cys Asp Ile Cys Gln Lys Phe Leu Leu 145 150 155 160 aat gga ttt cga tgt cag act tgt ggc tac aaa ttt cat gag cac tgt 528 Asn Gly Phe Arg Cys Gln Thr Cys Gly Tyr Lys Phe His Glu His Cys 165 170 175 agc acc aaa gta cct act atg tgt gtg gac tgg agt aac atc aga caa 576 Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn Ile Arg Gln 180 185 190 ctc tta ttg ttt cca aat tcc act att ggt gat agt gga gtc cca gca 624 Leu Leu Leu Phe Pro Asn Ser Thr Ile Gly Asp Ser Gly Val Pro Ala 195 200 205 cta cct tct ttg act atg cgt cgt atg cga gag tct gtt tcc agg atg 672 Leu Pro Ser Leu Thr Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 210 215 220 cct gtt agt tct cag cac aga tat tct aca cct cac gcc ttc acc ttt 720 Pro Val Ser Ser Gln His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 225 230 235 240 aac acc tcc agt ccc tca tct gaa ggt tcc ctc tcc cag agg cag agg 768 Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gln Arg Gln Arg 245 250 255 tcg aca tcc aca cct aat gtc cac atg gtc agc acc acg ctg cct gtg 816 Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu Pro Val 260 265 270 gac agc agg atg att gag gat gca att cga agt cac agc gaa tca gcc 864 Asp Ser Arg Met Ile Glu Asp Ala Ile Arg Ser His Ser Glu Ser Ala 275 280 285 tca cct tca gcc ctg tcc agt agc ccc aac aat ctg agc cca aca ggc 912 Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 290 295 300 tgg tca cag ccg aaa acc ccc gtg cca gca caa aga gag cgg gca cca 960 Trp Ser Gln Pro Lys Thr Pro Val Pro Ala Gln Arg Glu Arg Ala Pro 305 310 315 320 gta tct ggg acc cag gag aaa aac aaa att agg cct cgt gga cag aga 1008 Val Ser Gly Thr Gln Glu Lys Asn Lys Ile Arg Pro Arg Gly Gln Arg 325 330 335 gat tca agc tat tat tgg gaa ata gaa gcc agt gaa gtg atg ctg tcc 1056 Asp Ser Ser Tyr Tyr Trp Glu Ile Glu Ala Ser Glu Val Met Leu Ser 340 345 350 act cgg att ggg tca ggc tct ttt gga act gtt tat aag ggt aaa tgg 1104 Thr Arg Ile Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 355 360 365 cac gga gat gtt gca gta aag atc cta aag gtt gtc gac cca acc cca 1152 His Gly Asp Val Ala Val Lys Ile Leu Lys Val Val Asp Pro Thr Pro 370 375 380 gag caa ttc cag gcc ttc agg aat gag gtg gct gtt ctg cgc aaa aca 1200 Glu Gln Phe Gln Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 385 390 395 400 cgg cat gtg aac att ctg ctt ttc atg ggg tac atg aca aag gac aac 1248 Arg His Val Asn Ile Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 405 410 415 ctg gca att gtg acc cag tgg tgc gag ggc agc agc ctc tac aaa cac 1296 Leu Ala Ile Val Thr Gln Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 420 425 430 ctg cat gtc cag gag acc aag ttt cag atg ttc cag cta att gac att 1344 Leu His Val Gln Glu Thr Lys Phe Gln Met Phe Gln Leu Ile Asp Ile 435 440 445 gcc cgg cag acg gct cag gga atg gac tat ttg cat gca aag aac atc 1392 Ala Arg Gln Thr Ala Gln Gly Met Asp Tyr Leu His Ala Lys Asn Ile 450 455 460 atc cat aga gac atg aaa tcc aac aat ata ttt ctc cat gaa ggc tta 1440 Ile His Arg Asp Met Lys Ser Asn Asn Ile Phe Leu His Glu Gly Leu 465 470 475 480 aca gtg aaa att gga gat ttt ggt ttg gca aca gta aag tca cgc tgg 1488 Thr Val Lys Ile Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 485 490 495 agt ggt tct cag cag gtt gaa caa cct act ggc tct gtc ctc tgg atg 1536 Ser Gly Ser Gln Gln Val Glu Gln Pro Thr Gly Ser Val Leu Trp Met 500 505 510 gcc cca gag gtg atc cga atg cag gat aac aac cca ttc agt ttc cag 1584 Ala Pro Glu Val Ile Arg Met Gln Asp Asn Asn Pro Phe Ser Phe Gln 515 520 525 tcg gat gtc tac tcc tat ggc atc gta ttg tat gaa ctg atg acg ggg 1632 Ser Asp Val Tyr Ser Tyr Gly Ile Val Leu Tyr Glu Leu Met Thr Gly 530 535 540 gag ctt cct tat tct cac atc aac aac cga gat cag atc atc ttc atg 1680 Glu Leu Pro Tyr Ser His Ile Asn Asn Arg Asp Gln Ile Ile Phe Met 545 550 555 560 gtg ggc cga gga tat gcc tcc cca gat ctt agt aag cta tat aag aac 1728 Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Lys Leu Tyr Lys Asn 565 570 575 tgc ccc aaa gca atg aag agg ctg gta gct gac tgt gtg aag aaa gta 1776 Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 580 585 590 aag gaa gag agg cct ctt ttt ccc cag atc ctg tct tcc att gag ctg 1824 Lys Glu Glu Arg Pro Leu Phe Pro Gln Ile Leu Ser Ser Ile Glu Leu 595 600 605 ctc caa cac tct cta ccg aag atc aac cgg agc gct tcc gag cca tcc 1872 Leu Gln His Ser Leu Pro Lys Ile Asn Arg Ser Ala Ser Glu Pro Ser 610 615 620 ttg cat cgg gca gcc cac act gag gat atc aat gct tgc acg ctg acc 1920 Leu His Arg Ala Ala His Thr Glu Asp Ile Asn Ala Cys Thr Leu Thr 625 630 635 640 acg tcc ccg agg ctg cct gtc ttc tag 1947 Thr Ser Pro Arg Leu Pro Val Phe 645 4 648 PRT Homo sapiens 4 Met Glu His Ile Gln Gly Ala Trp Lys Thr Ile Ser Asn Gly Phe Gly 1 5 10 15 Phe Lys Asp Ala Val Phe Asp Gly Ser Ser Cys Ile Ser Pro Thr Ile 20 25 30 Val Gln Gln Phe Gly Tyr Gln Arg Arg Ala Ser Asp Asp Gly Lys Leu 35 40 45 Thr Asp Pro Ser Lys Thr Ser Asn Thr Ile Arg Val Phe Leu Pro Asn 50 55 60 Lys Gln Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 65 70 75 80 Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gln Pro Glu Cys Cys 85 90 95 Ala Val Phe Arg Leu Leu His Glu His Lys Gly Lys Lys Ala Arg Leu 100 105 110 Asp Trp Asn Thr Asp Ala Ala Ser Leu Ile Gly Glu Glu Leu Gln Val 115 120 125 Asp Phe Leu Asp His Val Pro Leu Thr Thr His Asn Phe Ala Arg Lys 130 135 140 Thr Phe Leu Lys Leu Ala Phe Cys Asp Ile Cys Gln Lys Phe Leu Leu 145 150 155 160 Asn Gly Phe Arg Cys Gln Thr Cys Gly Tyr Lys Phe His Glu His Cys 165 170 175 Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn Ile Arg Gln 180 185 190 Leu Leu Leu Phe Pro Asn Ser Thr Ile Gly Asp Ser Gly Val Pro Ala 195 200 205 Leu Pro Ser Leu Thr Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 210 215 220 Pro Val Ser Ser Gln His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 225 230 235 240 Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gln Arg Gln Arg 245 250 255 Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu Pro Val 260 265 270 Asp Ser Arg Met Ile Glu Asp Ala Ile Arg Ser His Ser Glu Ser Ala 275 280 285 Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 290 295 300 Trp Ser Gln Pro Lys Thr Pro Val Pro Ala Gln Arg Glu Arg Ala Pro 305 310 315 320 Val Ser Gly Thr Gln Glu Lys Asn Lys Ile Arg Pro Arg Gly Gln Arg 325 330 335 Asp Ser Ser Tyr Tyr Trp Glu Ile Glu Ala Ser Glu Val Met Leu Ser 340 345 350 Thr Arg Ile Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 355 360 365 His Gly Asp Val Ala Val Lys Ile Leu Lys Val Val Asp Pro Thr Pro 370 375 380 Glu Gln Phe Gln Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 385 390 395 400 Arg His Val Asn Ile Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 405 410 415 Leu Ala Ile Val Thr Gln Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 420 425 430 Leu His Val Gln Glu Thr Lys Phe Gln Met Phe Gln Leu Ile Asp Ile 435 440 445 Ala Arg Gln Thr Ala Gln Gly Met Asp Tyr Leu His Ala Lys Asn Ile 450 455 460 Ile His Arg Asp Met Lys Ser Asn Asn Ile Phe Leu His Glu Gly Leu 465 470 475 480 Thr Val Lys Ile Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 485 490 495 Ser Gly Ser Gln Gln Val Glu Gln Pro Thr Gly Ser Val Leu Trp Met 500 505 510 Ala Pro Glu Val Ile Arg Met Gln Asp Asn Asn Pro Phe Ser Phe Gln 515 520 525 Ser Asp Val Tyr Ser Tyr Gly Ile Val Leu Tyr Glu Leu Met Thr Gly 530 535 540 Glu Leu Pro Tyr Ser His Ile Asn Asn Arg Asp Gln Ile Ile Phe Met 545 550 555 560 Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Lys Leu Tyr Lys Asn 565 570 575 Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 580 585 590 Lys Glu Glu Arg Pro Leu Phe Pro Gln Ile Leu Ser Ser Ile Glu Leu 595 600 605 Leu Gln His Ser Leu Pro Lys Ile Asn Arg Ser Ala Ser Glu Pro Ser 610 615 620 Leu His Arg Ala Ala His Thr Glu Asp Ile Asn Ala Cys Thr Leu Thr 625 630 635 640 Thr Ser Pro Arg Leu Pro Val Phe 645 5 3450 DNA Homo sapiens CDS (1)..(3447) misc_structure (730)..(765) encode amino acids 244-255 for forming the sides of the enzymatic pocket 5 atg ggg cag cag cct gga aaa gta ctt ggg gac caa aga agg cca agc 48 Met Gly Gln Gln Pro Gly Lys Val Leu Gly Asp Gln Arg Arg Pro Ser 1 5 10 15 ttg cct gcc ctg cat ttt atc aaa gga gca ggg aag aag gaa tca tcg 96 Leu Pro Ala Leu His Phe Ile Lys Gly Ala Gly Lys Lys Glu Ser Ser 20 25 30 agg cat ggg ggt cca cac tgc aat gtt ttt gtg gaa cat gaa gcc ctt 144 Arg His Gly Gly Pro His Cys Asn Val Phe Val Glu His Glu Ala Leu 35 40 45 cag cgg cca gta gca tct gac ttt gag cct cag ggt ctg agt gaa gcc 192 Gln Arg Pro Val Ala Ser Asp Phe Glu Pro Gln Gly Leu Ser Glu Ala 50 55 60 gct cgt tgg aac tcc aag gaa aac ctt ctc gct gga ccc agt gaa aat 240 Ala Arg Trp Asn Ser Lys Glu Asn Leu Leu Ala Gly Pro Ser Glu Asn 65 70 75 80 gac ccc aac ctt ttc gtt gca ctg tat gat ttt gtg gcc agt gga gat 288 Asp Pro Asn Leu Phe Val Ala Leu Tyr Asp Phe Val Ala Ser Gly Asp 85 90 95 aac act cta agc ata act aaa ggt gaa aag ctc cgg gtc tta ggc tat 336 Asn Thr Leu Ser Ile Thr Lys Gly Glu Lys Leu Arg Val Leu Gly Tyr 100 105 110 aat cac aat ggg gaa tgg tgt gaa gcc caa acc aaa aat ggc caa ggc 384 Asn His Asn Gly Glu Trp Cys Glu Ala Gln Thr Lys Asn Gly Gln Gly 115 120 125 tgg gtc cca agc aac tac atc acg cca gtc aac agt ctg gag aaa cac 432 Trp Val Pro Ser Asn Tyr Ile Thr Pro Val Asn Ser Leu Glu Lys His 130 135 140 tcc tgg tac cat ggg cct gtg tcc cgc aat gcc gct gag tat ctg ctg 480 Ser Trp Tyr His Gly Pro Val Ser Arg Asn Ala Ala Glu Tyr Leu Leu 145 150 155 160 agc agc ggg atc aat ggc agc ttc ttg gtg cgt gag agt gag agc agt 528 Ser Ser Gly Ile Asn Gly Ser Phe Leu Val Arg Glu Ser Glu Ser Ser 165 170 175 cct ggc cag agg tcc atc tcg ctg aga tac gaa ggg agg gtg tac cat 576 Pro Gly Gln Arg Ser Ile Ser Leu Arg Tyr Glu Gly Arg Val Tyr His 180 185 190 tac agg atc aac act gct tct gat ggc aag ctc tac gtc tcc tcc gag 624 Tyr Arg Ile Asn Thr Ala Ser Asp Gly Lys Leu Tyr Val Ser Ser Glu 195 200 205 agc cgc ttc aac acc ctg gcc gag ttg gtt cat cat cat tca acg gtg 672 Ser Arg Phe Asn Thr Leu Ala Glu Leu Val His His His Ser Thr Val 210 215 220 gcc gac ggg ctc atc acc acg ctc cat tat cca gcc cca aag cgc aac 720 Ala Asp Gly Leu Ile Thr Thr Leu His Tyr Pro Ala Pro Lys Arg Asn 225 230 235 240 aag ccc act gtc tat ggt gtg tcc ccc aac tac gac aag tgg gag atg 768 Lys Pro Thr Val Tyr Gly Val Ser Pro Asn Tyr Asp Lys Trp Glu Met 245 250 255 gaa cgc acg gac atc acc atg aag cac aag ctg ggc ggg ggc cag tac 816 Glu Arg Thr Asp Ile Thr Met Lys His Lys Leu Gly Gly Gly Gln Tyr 260 265 270 ggg gag gtg tac gag ggc gtg tgg aag aaa tac agc ctg acg gtg gcc 864 Gly Glu Val Tyr Glu Gly Val Trp Lys Lys Tyr Ser Leu Thr Val Ala 275 280 285 gtg aag acc ttg aag gag gac acc atg gag gtg gaa gag ttc ttg aaa 912 Val Lys Thr Leu Lys Glu Asp Thr Met Glu Val Glu Glu Phe Leu Lys 290 295 300 gaa gct gca gtc atg aaa gag atc aaa cac cct aac ctg gtg cag ctc 960 Glu Ala Ala Val Met Lys Glu Ile Lys His Pro Asn Leu Val Gln Leu 305 310 315 320 ctt ggg gtc tgc acc cgg gag ccc ccg ttc tat atc atc act gag ttc 1008 Leu Gly Val Cys Thr Arg Glu Pro Pro Phe Tyr Ile Ile Thr Glu Phe 325 330 335 atg acc tac ggg aac ctc ctg gac tac ctg agg gag tgc aac cgg cag 1056 Met Thr Tyr Gly Asn Leu Leu Asp Tyr Leu Arg Glu Cys Asn Arg Gln 340 345 350 gag gtg aac gcc gtg gtg ctg ctg tac atg gcc act cag atc tcg tca 1104 Glu Val Asn Ala Val Val Leu Leu Tyr Met Ala Thr Gln Ile Ser Ser 355 360 365 gcc atg gag tac ctg gag aag aaa aac ttc atc cac aga gat ctt gct 1152 Ala Met Glu Tyr Leu Glu Lys Lys Asn Phe Ile His Arg Asp Leu Ala 370 375 380 gcc cga aac tgc ctg gta ggg gag aac cac ttg gtg aag gta gct gat 1200 Ala Arg Asn Cys Leu Val Gly Glu Asn His Leu Val Lys Val Ala Asp 385 390 395 400 ttt ggc ctg agc agg ttg atg aca ggg gac acc tac aca gcc cat gct 1248 Phe Gly Leu Ser Arg Leu Met Thr Gly Asp Thr Tyr Thr Ala His Ala 405 410 415 gga gcc aag ttc ccc atc aaa tgg act gca ccc gag agc ctg gcc tac 1296 Gly Ala Lys Phe Pro Ile Lys Trp Thr Ala Pro Glu Ser Leu Ala Tyr 420 425 430 aac aag ttc tcc atc aag tcc gac gtc tgg gca ttt gga gta ttg ctt 1344 Asn Lys Phe Ser Ile Lys Ser Asp Val Trp Ala Phe Gly Val Leu Leu 435 440 445 tgg gaa att gct acc tat ggc atg tcc cct tac ccg gga att gac ctg 1392 Trp Glu Ile Ala Thr Tyr Gly Met Ser Pro Tyr Pro Gly Ile Asp Leu 450 455 460 tcc cag gtg tat gag ctg cta gag aag gac tac cgc atg gag cgc cca 1440 Ser Gln Val Tyr Glu Leu Leu Glu Lys Asp Tyr Arg Met Glu Arg Pro 465 470 475 480 gaa ggc tgc cca gag aag gtc tat gaa ctc atg cga gca tgt tgg cag 1488 Glu Gly Cys Pro Glu Lys Val Tyr Glu Leu Met Arg Ala Cys Trp Gln 485 490 495 tgg aat ccc tct gac cgg ccc tcc ttt gct gaa atc cac caa gcc ttt 1536 Trp Asn Pro Ser Asp Arg Pro Ser Phe Ala Glu Ile His Gln Ala Phe 500 505 510 gaa aca atg ttc cag gaa tcc agt atc tca gac gaa gtg gaa aag gag 1584 Glu Thr Met Phe Gln Glu Ser Ser Ile Ser Asp Glu Val Glu Lys Glu 515 520 525 ctg ggg aaa caa ggc gtc cgt ggg gct gtg agt acc ttg ctg cag gcc 1632 Leu Gly Lys Gln Gly Val Arg Gly Ala Val Ser Thr Leu Leu Gln Ala 530 535 540 cca gag ctg ccc acc aag acg agg acc tcc agg aga gct gca gag cac 1680 Pro Glu Leu Pro Thr Lys Thr Arg Thr Ser Arg Arg Ala Ala Glu His 545 550 555 560 aga gac acc act gac gtg cct gag atg cct cac tcc aag ggc cag gga 1728 Arg Asp Thr Thr Asp Val Pro Glu Met Pro His Ser Lys Gly Gln Gly 565 570 575 gag agc gat cct ctg gac cat gag cct gcc gtg tct cca ttg ctc cct 1776 Glu Ser Asp Pro Leu Asp His Glu Pro Ala Val Ser Pro Leu Leu Pro 580 585 590 cga aaa gag cga ggt ccc ccg gag ggc ggc ctg aat gaa gat gag cgc 1824 Arg Lys Glu Arg Gly Pro Pro Glu Gly Gly Leu Asn Glu Asp Glu Arg 595 600 605 ctt ctc ccc aaa gac aaa aag acc aac ttg ttc agc gcc ttg atc aag 1872 Leu Leu Pro Lys Asp Lys Lys Thr Asn Leu Phe Ser Ala Leu Ile Lys 610 615 620 aag aag aag aag aca gcc cca acc cct ccc aaa cgc agc agc tcc ttc 1920 Lys Lys Lys Lys Thr Ala Pro Thr Pro Pro Lys Arg Ser Ser Ser Phe 625 630 635 640 cgg gag atg gac ggc cag ccg gag cgc aga ggg gcc ggc gag gaa gag 1968 Arg Glu Met Asp Gly Gln Pro Glu Arg Arg Gly Ala Gly Glu Glu Glu 645 650 655 ggc cga gac atc agc aac ggg gca ctg gct ttc acc ccc ttg gac aca 2016 Gly Arg Asp Ile Ser Asn Gly Ala Leu Ala Phe Thr Pro Leu Asp Thr 660 665 670 gct gac cca gcc aag tcc cca aag ccc agc aat ggg gct ggg gtc ccc 2064 Ala Asp Pro Ala Lys Ser Pro Lys Pro Ser Asn Gly Ala Gly Val Pro 675 680 685 aat gga gcc ctc cgg gag tcc ggg ggc tca ggc ttc cgg tct ccc cac 2112 Asn Gly Ala Leu Arg Glu Ser Gly Gly Ser Gly Phe Arg Ser Pro His 690 695 700 ctg tgg aag aag tcc agc acg ctg acc agc agc cgc cta gcc acc ggc 2160 Leu Trp Lys Lys Ser Ser Thr Leu Thr Ser Ser Arg Leu Ala Thr Gly 705 710 715 720 gag gag gag ggc ggt ggc agc tcc agc aag cgc ttc ctg cgc tct tgc 2208 Glu Glu Glu Gly Gly Gly Ser Ser Ser Lys Arg Phe Leu Arg Ser Cys 725 730 735 tcc gcc tcc tgc gtt ccc cat ggg gcc aag gac acg gag tgg agg tca 2256 Ser Ala Ser Cys Val Pro His Gly Ala Lys Asp Thr Glu Trp Arg Ser 740 745 750 gtc acg ctg cct cgg gac ttg cag tcc acg gga aga cag ttt gac tcg 2304 Val Thr Leu Pro Arg Asp Leu Gln Ser Thr Gly Arg Gln Phe Asp Ser 755 760 765 tcc aca ttt gga ggg cac aaa agt gag aag ccg gct ctg cct cgg aag 2352 Ser Thr Phe Gly Gly His Lys Ser Glu Lys Pro Ala Leu Pro Arg Lys 770 775 780 agg gca ggg gag aac agg tct gac cag gtg acc cga ggc aca gta acg 2400 Arg Ala Gly Glu Asn Arg Ser Asp Gln Val Thr Arg Gly Thr Val Thr 785 790 795 800 cct ccc ccc agg ctg gtg aaa aag aat gag gaa gct gct gat gag gtc 2448 Pro Pro Pro Arg Leu Val Lys Lys Asn Glu Glu Ala Ala Asp Glu Val 805 810 815 ttc aaa gac atc atg gag tcc agc ccg ggc tcc agc ccg ccc aac ctg 2496 Phe Lys Asp Ile Met Glu Ser Ser Pro Gly Ser Ser Pro Pro Asn Leu 820 825 830 act cca aaa ccc ctc cgg cgg cag gtc acc gtg gcc cct gcc tcg ggc 2544 Thr Pro Lys Pro Leu Arg Arg Gln Val Thr Val Ala Pro Ala Ser Gly 835 840 845 ctc ccc cac aag gaa gaa gct gga aag ggc agt gcc tta ggg acc cct 2592 Leu Pro His Lys Glu Glu Ala Gly Lys Gly Ser Ala Leu Gly Thr Pro 850 855 860 gct gca gct gag cca gtg acc ccc acc agc aaa gca ggc tca ggt gca 2640 Ala Ala Ala Glu Pro Val Thr Pro Thr Ser Lys Ala Gly Ser Gly Ala 865 870 875 880 cca ggg ggc acc agc aag ggc ccc gcc gag gag tcc aga gtg agg agg 2688 Pro Gly Gly Thr Ser Lys Gly Pro Ala Glu Glu Ser Arg Val Arg Arg 885 890 895 cac aag cac tcc tct gag tcg cca ggg agg gac aag ggg aaa ttg tcc 2736 His Lys His Ser Ser Glu Ser Pro Gly Arg Asp Lys Gly Lys Leu Ser 900 905 910 agg ctc aaa cct gcc ccg ccg ccc cca cca gca gcc tct gca ggg aag 2784 Arg Leu Lys Pro Ala Pro Pro Pro Pro Pro Ala Ala Ser Ala Gly Lys 915 920 925 gct gga gga aag ccc tcg cag agc ccg agc cag gag gcg gcc ggg gag 2832 Ala Gly Gly Lys Pro Ser Gln Ser Pro Ser Gln Glu Ala Ala Gly Glu 930 935 940 gca gtc ctg ggc gca aag aca aaa gcc acg agt ctg gtt gat gct gtg 2880 Ala Val Leu Gly Ala Lys Thr Lys Ala Thr Ser Leu Val Asp Ala Val 945 950 955 960 aac agt gac gct gcc aag ccc agc cag ccg gga gag ggc ctc aaa aag 2928 Asn Ser Asp Ala Ala Lys Pro Ser Gln Pro Gly Glu Gly Leu Lys Lys 965 970 975 ccc gtg ctc ccg gcc act cca aag cca cag tcc gcc aag ccg tcg ggg 2976 Pro Val Leu Pro Ala Thr Pro Lys Pro Gln Ser Ala Lys Pro Ser Gly 980 985 990 acc ccc atc agc cca gcc ccc gtt ccc tcc acg ttg cca tca gca tcc 3024 Thr Pro Ile Ser Pro Ala Pro Val Pro Ser Thr Leu Pro Ser Ala Ser 995 1000 1005 tcg gcc ctg gca ggg gac cag ccg tct tcc acc gcc ttc atc cct 3069 Ser Ala Leu Ala Gly Asp Gln Pro Ser Ser Thr Ala Phe Ile Pro 1010 1015 1020 ctc ata tca acc cga gtg tct ctt cgg aaa acc cgc cag cct cca 3114 Leu Ile Ser Thr Arg Val Ser Leu Arg Lys Thr Arg Gln Pro Pro 1025 1030 1035 gag cgg atc gcc agc ggc gcc atc acc aag ggc gtg gtc ctg gac 3159 Glu Arg Ile Ala Ser Gly Ala Ile Thr Lys Gly Val Val Leu Asp 1040 1045 1050 agc acc gag gcg ctg tgc ctc gcc atc tct agg aac tcc gag cag 3204 Ser Thr Glu Ala Leu Cys Leu Ala Ile Ser Arg Asn Ser Glu Gln 1055 1060 1065 atg gcc agc cac agc gca gtg ctg gag gcc ggc aaa aac ctc tac 3249 Met Ala Ser His Ser Ala Val Leu Glu Ala Gly Lys Asn Leu Tyr 1070 1075 1080 acg ttc tgc gtg agc tat gtg gat tcc atc cag caa atg agg aac 3294 Thr Phe Cys Val Ser Tyr Val Asp Ser Ile Gln Gln Met Arg Asn 1085 1090 1095 aag ttt gcc ttc cga gag gcc atc aac aaa ctg gag aat aat ctc 3339 Lys Phe Ala Phe Arg Glu Ala Ile Asn Lys Leu Glu Asn Asn Leu 1100 1105 1110 cgg gag ctt cag atc tgc ccg gcg aca gca ggc agt ggt ccg gcg 3384 Arg Glu Leu Gln Ile Cys Pro Ala Thr Ala Gly Ser Gly Pro Ala 1115 1120 1125 gcc act cag gac ttc agc aag ctc ctc agt tcg gtg aag gaa atc 3429 Ala Thr Gln Asp Phe Ser Lys Leu Leu Ser Ser Val Lys Glu Ile 1130 1135 1140 agt gac ata gtg cag agg tag 3450 Ser Asp Ile Val Gln Arg 1145 6 1149 PRT Homo sapiens 6 Met Gly Gln Gln Pro Gly Lys Val Leu Gly Asp Gln Arg Arg Pro Ser 1 5 10 15 Leu Pro Ala Leu His Phe Ile Lys Gly Ala Gly Lys Lys Glu Ser Ser 20 25 30 Arg His Gly Gly Pro His Cys Asn Val Phe Val Glu His Glu Ala Leu 35 40 45 Gln Arg Pro Val Ala Ser Asp Phe Glu Pro Gln Gly Leu Ser Glu Ala 50 55 60 Ala Arg Trp Asn Ser Lys Glu Asn Leu Leu Ala Gly Pro Ser Glu Asn 65 70 75 80 Asp Pro Asn Leu Phe Val Ala Leu Tyr Asp Phe Val Ala Ser Gly Asp 85 90 95 Asn Thr Leu Ser Ile Thr Lys Gly Glu Lys Leu Arg Val Leu Gly Tyr 100 105 110 Asn His Asn Gly Glu Trp Cys Glu Ala Gln Thr Lys Asn Gly Gln Gly 115 120 125 Trp Val Pro Ser Asn Tyr Ile Thr Pro Val Asn Ser Leu Glu Lys His 130 135 140 Ser Trp Tyr His Gly Pro Val Ser Arg Asn Ala Ala Glu Tyr Leu Leu 145 150 155 160 Ser Ser Gly Ile Asn Gly Ser Phe Leu Val Arg Glu Ser Glu Ser Ser 165 170 175 Pro Gly Gln Arg Ser Ile Ser Leu Arg Tyr Glu Gly Arg Val Tyr His 180 185 190 Tyr Arg Ile Asn Thr Ala Ser Asp Gly Lys Leu Tyr Val Ser Ser Glu 195 200 205 Ser Arg Phe Asn Thr Leu Ala Glu Leu Val His His His Ser Thr Val 210 215 220 Ala Asp Gly Leu Ile Thr Thr Leu His Tyr Pro Ala Pro Lys Arg Asn 225 230 235 240 Lys Pro Thr Val Tyr Gly Val Ser Pro Asn Tyr Asp Lys Trp Glu Met 245 250 255 Glu Arg Thr Asp Ile Thr Met Lys His Lys Leu Gly Gly Gly Gln Tyr 260 265 270 Gly Glu Val Tyr Glu Gly Val Trp Lys Lys Tyr Ser Leu Thr Val Ala 275 280 285 Val Lys Thr Leu Lys Glu Asp Thr Met Glu Val Glu Glu Phe Leu Lys 290 295 300 Glu Ala Ala Val Met Lys Glu Ile Lys His Pro Asn Leu Val Gln Leu 305 310 315 320 Leu Gly Val Cys Thr Arg Glu Pro Pro Phe Tyr Ile Ile Thr Glu Phe 325 330 335 Met Thr Tyr Gly Asn Leu Leu Asp Tyr Leu Arg Glu Cys Asn Arg Gln 340 345 350 Glu Val Asn Ala Val Val Leu Leu Tyr Met Ala Thr Gln Ile Ser Ser 355 360 365 Ala Met Glu Tyr Leu Glu Lys Lys Asn Phe Ile His Arg Asp Leu Ala 370 375 380 Ala Arg Asn Cys Leu Val Gly Glu Asn His Leu Val Lys Val Ala Asp 385 390 395 400 Phe Gly Leu Ser Arg Leu Met Thr Gly Asp Thr Tyr Thr Ala His Ala 405 410 415 Gly Ala Lys Phe Pro Ile Lys Trp Thr Ala Pro Glu Ser Leu Ala Tyr 420 425 430 Asn Lys Phe Ser Ile Lys Ser Asp Val Trp Ala Phe Gly Val Leu Leu 435 440 445 Trp Glu Ile Ala Thr Tyr Gly Met Ser Pro Tyr Pro Gly Ile Asp Leu 450 455 460 Ser Gln Val Tyr Glu Leu Leu Glu Lys Asp Tyr Arg Met Glu Arg Pro 465 470 475 480 Glu Gly Cys Pro Glu Lys Val Tyr Glu Leu Met Arg Ala Cys Trp Gln 485 490 495 Trp Asn Pro Ser Asp Arg Pro Ser Phe Ala Glu Ile His Gln Ala Phe 500 505 510 Glu Thr Met Phe Gln Glu Ser Ser Ile Ser Asp Glu Val Glu Lys Glu 515 520 525 Leu Gly Lys Gln Gly Val Arg Gly Ala Val Ser Thr Leu Leu Gln Ala 530 535 540 Pro Glu Leu Pro Thr Lys Thr Arg Thr Ser Arg Arg Ala Ala Glu His 545 550 555 560 Arg Asp Thr Thr Asp Val Pro Glu Met Pro His Ser Lys Gly Gln Gly 565 570 575 Glu Ser Asp Pro Leu Asp His Glu Pro Ala Val Ser Pro Leu Leu Pro 580 585 590 Arg Lys Glu Arg Gly Pro Pro Glu Gly Gly Leu Asn Glu Asp Glu Arg 595 600 605 Leu Leu Pro Lys Asp Lys Lys Thr Asn Leu Phe Ser Ala Leu Ile Lys 610 615 620 Lys Lys Lys Lys Thr Ala Pro Thr Pro Pro Lys Arg Ser Ser Ser Phe 625 630 635 640 Arg Glu Met Asp Gly Gln Pro Glu Arg Arg Gly Ala Gly Glu Glu Glu 645 650 655 Gly Arg Asp Ile Ser Asn Gly Ala Leu Ala Phe Thr Pro Leu Asp Thr 660 665 670 Ala Asp Pro Ala Lys Ser Pro Lys Pro Ser Asn Gly Ala Gly Val Pro 675 680 685 Asn Gly Ala Leu Arg Glu Ser Gly Gly Ser Gly Phe Arg Ser Pro His 690 695 700 Leu Trp Lys Lys Ser Ser Thr Leu Thr Ser Ser Arg Leu Ala Thr Gly 705 710 715 720 Glu Glu Glu Gly Gly Gly Ser Ser Ser Lys Arg Phe Leu Arg Ser Cys 725 730 735 Ser Ala Ser Cys Val Pro His Gly Ala Lys Asp Thr Glu Trp Arg Ser 740 745 750 Val Thr Leu Pro Arg Asp Leu Gln Ser Thr Gly Arg Gln Phe Asp Ser 755 760 765 Ser Thr Phe Gly Gly His Lys Ser Glu Lys Pro Ala Leu Pro Arg Lys 770 775 780 Arg Ala Gly Glu Asn Arg Ser Asp Gln Val Thr Arg Gly Thr Val Thr 785 790 795 800 Pro Pro Pro Arg Leu Val Lys Lys Asn Glu Glu Ala Ala Asp Glu Val 805 810 815 Phe Lys Asp Ile Met Glu Ser Ser Pro Gly Ser Ser Pro Pro Asn Leu 820 825 830 Thr Pro Lys Pro Leu Arg Arg Gln Val Thr Val Ala Pro Ala Ser Gly 835 840 845 Leu Pro His Lys Glu Glu Ala Gly Lys Gly Ser Ala Leu Gly Thr Pro 850 855 860 Ala Ala Ala Glu Pro Val Thr Pro Thr Ser Lys Ala Gly Ser Gly Ala 865 870 875 880 Pro Gly Gly Thr Ser Lys Gly Pro Ala Glu Glu Ser Arg Val Arg Arg 885 890 895 His Lys His Ser Ser Glu Ser Pro Gly Arg Asp Lys Gly Lys Leu Ser 900 905 910 Arg Leu Lys Pro Ala Pro Pro Pro Pro Pro Ala Ala Ser Ala Gly Lys 915 920 925 Ala Gly Gly Lys Pro Ser Gln Ser Pro Ser Gln Glu Ala Ala Gly Glu 930 935 940 Ala Val Leu Gly Ala Lys Thr Lys Ala Thr Ser Leu Val Asp Ala Val 945 950 955 960 Asn Ser Asp Ala Ala Lys Pro Ser Gln Pro Gly Glu Gly Leu Lys Lys 965 970 975 Pro Val Leu Pro Ala Thr Pro Lys Pro Gln Ser Ala Lys Pro Ser Gly 980 985 990 Thr Pro Ile Ser Pro Ala Pro Val Pro Ser Thr Leu Pro Ser Ala Ser 995 1000 1005 Ser Ala Leu Ala Gly Asp Gln Pro Ser Ser Thr Ala Phe Ile Pro 1010 1015 1020 Leu Ile Ser Thr Arg Val Ser Leu Arg Lys Thr Arg Gln Pro Pro 1025 1030 1035 Glu Arg Ile Ala Ser Gly Ala Ile Thr Lys Gly Val Val Leu Asp 1040 1045 1050 Ser Thr Glu Ala Leu Cys Leu Ala Ile Ser Arg Asn Ser Glu Gln 1055 1060 1065 Met Ala Ser His Ser Ala Val Leu Glu Ala Gly Lys Asn Leu Tyr 1070 1075 1080 Thr Phe Cys Val Ser Tyr Val Asp Ser Ile Gln Gln Met Arg Asn 1085 1090 1095 Lys Phe Ala Phe Arg Glu Ala Ile Asn Lys Leu Glu Asn Asn Leu 1100 1105 1110 Arg Glu Leu Gln Ile Cys Pro Ala Thr Ala Gly Ser Gly Pro Ala 1115 1120 1125 Ala Thr Gln Asp Phe Ser Lys Leu Leu Ser Ser Val Lys Glu Ile 1130 1135 1140 Ser Asp Ile Val Gln Arg 1145 7 2982 DNA Homo sapiens CDS (1)..(2979) misc_structure (2053)..(2058) codons 835 and 836 that encode amino acids to help forming the enzymatic site 7 atg ccg gcg ttg gcg cgc gac gcg ggc acc gtg ccg ctg ctc gtt gtt 48 Met Pro Ala Leu Ala Arg Asp Ala Gly Thr Val Pro Leu Leu Val Val 1 5 10 15 ttt tct gca atg ata ttt ggg act att aca aat caa gat ctg cct gtg 96 Phe Ser Ala Met Ile Phe Gly Thr Ile Thr Asn Gln Asp Leu Pro Val 20 25 30 atc aag tgt gtt tta atc aat cat aag aac aat gat tca tca gtg ggg 144 Ile Lys Cys Val Leu Ile Asn His Lys Asn Asn Asp Ser Ser Val Gly 35 40 45 aag tca tca tca tat ccc atg gta tca gaa tcc ccg gaa gac ctc ggg 192 Lys Ser Ser Ser Tyr Pro Met Val Ser Glu Ser Pro Glu Asp Leu Gly 50 55 60 tgt gcg ttg aga ccc cag agc tca ggg aca gtg tac gaa gct gcc gct 240 Cys Ala Leu Arg Pro Gln Ser Ser Gly Thr Val Tyr Glu Ala Ala Ala 65 70 75 80 gtg gaa gtg gat gta tct gct tcc atc aca ctg caa gtg ctg gtc gat 288 Val Glu Val Asp Val Ser Ala Ser Ile Thr Leu Gln Val Leu Val Asp 85 90 95 gcc cca ggg aac att tcc tgt ctc tgg gtc ttt aag cac agc tcc ctg 336 Ala Pro Gly Asn Ile Ser Cys Leu Trp Val Phe Lys His Ser Ser Leu 100 105 110 aat tgc cag cca cat ttt gat tta caa aac aga gga gtt gtt tcc atg 384 Asn Cys Gln Pro His Phe Asp Leu Gln Asn Arg Gly Val Val Ser Met 115 120 125 gtc att ttg aaa atg aca gaa acc caa gct gga gaa tac cta ctt ttt 432 Val Ile Leu Lys Met Thr Glu Thr Gln Ala Gly Glu Tyr Leu Leu Phe 130 135 140 att cag agt gaa gct acc aat tac aca ata ttg ttt aca gtg agt ata 480 Ile Gln Ser Glu Ala Thr Asn Tyr Thr Ile Leu Phe Thr Val Ser Ile 145 150 155 160 aga aat acc ctg ctt tac aca tta aga aga cct tac ttt aga aaa atg 528 Arg Asn Thr Leu Leu Tyr Thr Leu Arg Arg Pro Tyr Phe Arg Lys Met 165 170 175 gaa aac cag gac gcc ctg gtc tgc ata tct gag agc gtt cca gag ccg 576 Glu Asn Gln Asp Ala Leu Val Cys Ile Ser Glu Ser Val Pro Glu Pro 180 185 190 atc gtg gaa tgg gtg ctt tgc gat tca cag ggg gaa agc tgt aaa gaa 624 Ile Val Glu Trp Val Leu Cys Asp Ser Gln Gly Glu Ser Cys Lys Glu 195 200 205 gaa agt cca gct gtt gtt aaa aag gag gaa aaa gtg ctt cat gaa tta 672 Glu Ser Pro Ala Val Val Lys Lys Glu Glu Lys Val Leu His Glu Leu 210 215 220 ttt ggg acg gac ata agg tgc tgt gcc aga aat gaa ctg ggc agg gaa 720 Phe Gly Thr Asp Ile Arg Cys Cys Ala Arg Asn Glu Leu Gly Arg Glu 225 230 235 240 tgc acc agg ctg ttc aca ata gat cta aat caa act cct cag acc aca 768 Cys Thr Arg Leu Phe Thr Ile Asp Leu Asn Gln Thr Pro Gln Thr Thr 245 250 255 ttg cca caa tta ttt ctt aaa gta ggg gaa ccc tta tgg ata agg tgc 816 Leu Pro Gln Leu Phe Leu Lys Val Gly Glu Pro Leu Trp Ile Arg Cys 260 265 270 aaa gct gtt cat gtg aac cat gga ttc ggg ctc acc tgg gaa tta gaa 864 Lys Ala Val His Val Asn His Gly Phe Gly Leu Thr Trp Glu Leu Glu 275 280 285 aac aaa gca ctc gag gag ggc aac tac ttt gag atg agt acc tat tca 912 Asn Lys Ala Leu Glu Glu Gly Asn Tyr Phe Glu Met Ser Thr Tyr Ser 290 295 300 aca aac aga act atg ata cgg att ctg ttt gct ttt gta tca tca gtg 960 Thr Asn Arg Thr Met Ile Arg Ile Leu Phe Ala Phe Val Ser Ser Val 305 310 315 320 gca aga aac gac acc gga tac tac act tgt tcc tct tca aag cat ccc 1008 Ala Arg Asn Asp Thr Gly Tyr Tyr Thr Cys Ser Ser Ser Lys His Pro 325 330 335 agt caa tca gct ttg gtt acc atc gta gga aag gga ttt ata aat gct 1056 Ser Gln Ser Ala Leu Val Thr Ile Val Gly Lys Gly Phe Ile Asn Ala 340 345 350 acc aat tca agt gaa gat tat gaa att gac caa tat gaa gag ttt tgt 1104 Thr Asn Ser Ser Glu Asp Tyr Glu Ile Asp Gln Tyr Glu Glu Phe Cys 355 360 365 ttt tct gtc agg ttt aaa gcc tac cca caa atc aga tgt acg tgg acc 1152 Phe Ser Val Arg Phe Lys Ala Tyr Pro Gln Ile Arg Cys Thr Trp Thr 370 375 380 ttc tct cga aaa tca ttt cct tgt gag caa aag ggt ctt gat aac gga 1200 Phe Ser Arg Lys Ser Phe Pro Cys Glu Gln Lys Gly Leu Asp Asn Gly 385 390 395 400 tac agc ata tcc aag ttt tgc aat cat aag cac cag cca gga gaa tat 1248 Tyr Ser Ile Ser Lys Phe Cys Asn His Lys His Gln Pro Gly Glu Tyr 405 410 415 ata ttc cat gca gaa aat gat gat gcc caa ttt acc aaa atg ttc acg 1296 Ile Phe His Ala Glu Asn Asp Asp Ala Gln Phe Thr Lys Met Phe Thr 420 425 430 ctg aat ata aga agg aaa cct caa gtg ctc gca gaa gca tcg gca agt 1344 Leu Asn Ile Arg Arg Lys Pro Gln Val Leu Ala Glu Ala Ser Ala Ser 435 440 445 cag gcg tcc tgt ttc tcg gat gga tac cca tta cca tct tgg acc tgg 1392 Gln Ala Ser Cys Phe Ser Asp Gly Tyr Pro Leu Pro Ser Trp Thr Trp 450 455 460 aag aag tgt tca gac aag tct ccc aac tgc aca gaa gag atc aca gaa 1440 Lys Lys Cys Ser Asp Lys Ser Pro Asn Cys Thr Glu Glu Ile Thr Glu 465 470 475 480 gga gtc tgg aat aga aag gct aac aga aaa gtg ttt gga cag tgg gtg 1488 Gly Val Trp Asn Arg Lys Ala Asn Arg Lys Val Phe Gly Gln Trp Val 485 490 495 tcg agc agt act cta aac atg agt gaa gcc ata aaa ggg ttc ctg gtc 1536 Ser Ser Ser Thr Leu Asn Met Ser Glu Ala Ile Lys Gly Phe Leu Val 500 505 510 aag tgc tgt gca tac aat tcc ctt ggc aca tct tgt gag acg atc ctt 1584 Lys Cys Cys Ala Tyr Asn Ser Leu Gly Thr Ser Cys Glu Thr Ile Leu 515 520 525 tta aac tct cca ggc ccc ttc cct ttc atc caa gac aac atc tca ttc 1632 Leu Asn Ser Pro Gly Pro Phe Pro Phe Ile Gln Asp Asn Ile Ser Phe 530 535 540 tat gca aca att ggt gtt tgt ctc ctc ttc att gtc gtt tta acc ctg 1680 Tyr Ala Thr Ile Gly Val Cys Leu Leu Phe Ile Val Val Leu Thr Leu 545 550 555 560 cta att tgt cac aag tac aaa aag caa ttt agg tat gaa agc cag cta 1728 Leu Ile Cys His Lys Tyr Lys Lys Gln Phe Arg Tyr Glu Ser Gln Leu 565 570 575 cag atg gta cag gtg acc ggc tcc tca gat aat gag tac ttc tac gtt 1776 Gln Met Val Gln Val Thr Gly Ser Ser Asp Asn Glu Tyr Phe Tyr Val 580 585 590 gat ttc aga gaa tat gaa tat gat ctc aaa tgg gag ttt cca aga gaa 1824 Asp Phe Arg Glu Tyr Glu Tyr Asp Leu Lys Trp Glu Phe Pro Arg Glu 595 600 605 aat tta gag ttt ggg aag gta cta gga tca ggt gct ttt gga aaa gtg 1872 Asn Leu Glu Phe Gly Lys Val Leu Gly Ser Gly Ala Phe Gly Lys Val 610 615 620 atg aac gca aca gct tat gga att agc aaa aca gga gtc tca atc cag 1920 Met Asn Ala Thr Ala Tyr Gly Ile Ser Lys Thr Gly Val Ser Ile Gln 625 630 635 640 gtt gcc gtc aaa atg ctg aaa gaa aaa gca gac agc tct gaa aga gag 1968 Val Ala Val Lys Met Leu Lys Glu Lys Ala Asp Ser Ser Glu Arg Glu 645 650 655 gca ctc atg tca gaa ctc aag atg atg acc cag ctg gga agc cac gag 2016 Ala Leu Met Ser Glu Leu Lys Met Met Thr Gln Leu Gly Ser His Glu 660 665 670 aat att gtg aac ctg ctg ggg gcg tgc aca ctg tca gga cca att tac 2064 Asn Ile Val Asn Leu Leu Gly Ala Cys Thr Leu Ser Gly Pro Ile Tyr 675 680 685 ttg att ttt gaa tac tgt tgc tat ggt gat ctt ctc aac tat cta aga 2112 Leu Ile Phe Glu Tyr Cys Cys Tyr Gly Asp Leu Leu Asn Tyr Leu Arg 690 695 700 agt aaa aga gaa aaa ttt cac agg act tgg aca gag att ttc aag gaa 2160 Ser Lys Arg Glu Lys Phe His Arg Thr Trp Thr Glu Ile Phe Lys Glu 705 710 715 720 cac aat ttc agt ttt tac ccc act ttc caa tca cat cca aat tcc agc 2208 His Asn Phe Ser Phe Tyr Pro Thr Phe Gln Ser His Pro Asn Ser Ser 725 730 735 atg cct ggt tca aga gaa gtt cag ata cac ccg gac tcg gat caa atc 2256 Met Pro Gly Ser Arg Glu Val Gln Ile His Pro Asp Ser Asp Gln Ile 740 745 750 tca ggg ctt cat ggg aat tca ttt cac tct gaa gat gaa att gaa tat 2304 Ser Gly Leu His Gly Asn Ser Phe His Ser Glu Asp Glu Ile Glu Tyr 755 760 765 gaa aac caa aaa agg ctg gaa gaa gag gag gac ttg aat gtg ctt aca 2352 Glu Asn Gln Lys Arg Leu Glu Glu Glu Glu Asp Leu Asn Val Leu Thr 770 775 780 ttt gaa gat ctt ctt tgc ttt gca tat caa gtt gcc aaa gga atg gaa 2400 Phe Glu Asp Leu Leu Cys Phe Ala Tyr Gln Val Ala Lys Gly Met Glu 785 790 795 800 ttt ctg gaa ttt aag tcg tgt gtt cac aga gac ctg gcc gcc agg aac 2448 Phe Leu Glu Phe Lys Ser Cys Val His Arg Asp Leu Ala Ala Arg Asn 805 810 815 gtg ctt gtc acc cac ggg aaa gtg gtg aag ata tgt gac ttt gga ttg 2496 Val Leu Val Thr His Gly Lys Val Val Lys Ile Cys Asp Phe Gly Leu 820 825 830 gct cga gat atc atg agt gat tcc aac tat gtt gtc agg ggc aat gcc 2544 Ala Arg Asp Ile Met Ser Asp Ser Asn Tyr Val Val Arg Gly Asn Ala 835 840 845 cgt ctg cct gta aaa tgg atg gcc ccc gaa agc ctg ttt gaa ggc atc 2592 Arg Leu Pro Val Lys Trp Met Ala Pro Glu Ser Leu Phe Glu Gly Ile 850 855 860 tac acc att aag agt gat gtc tgg tca tat gga ata tta ctg tgg gaa 2640 Tyr Thr Ile Lys Ser Asp Val Trp Ser Tyr Gly Ile Leu Leu Trp Glu 865 870 875 880 atc ttc tca ctt ggt gtg aat cct tac cct ggc att ccg gtt gat gct 2688 Ile Phe Ser Leu Gly Val Asn Pro Tyr Pro Gly Ile Pro Val Asp Ala 885 890 895 aac ttc tac aaa ctg att caa aat gga ttt aaa atg gat cag cca ttt 2736 Asn Phe Tyr Lys Leu Ile Gln Asn Gly Phe Lys Met Asp Gln Pro Phe 900 905 910 tat gct aca gaa gaa ata tac att ata atg caa tcc tgc tgg gct ttt 2784 Tyr Ala Thr Glu Glu Ile Tyr Ile Ile Met Gln Ser Cys Trp Ala Phe 915 920 925 gac tca agg aaa cgg cca tcc ttc cct aat ttg act tcg ttt tta gga 2832 Asp Ser Arg Lys Arg Pro Ser Phe Pro Asn Leu Thr Ser Phe Leu Gly 930 935 940 tgt cag ctg gca gat gca gaa gaa gcg atg tat cag aat gtg gat ggc 2880 Cys Gln Leu Ala Asp Ala Glu Glu Ala Met Tyr Gln Asn Val Asp Gly 945 950 955 960 cgt gtt tcg gaa tgt cct cac acc tac caa aac agg cga cct ttc agc 2928 Arg Val Ser Glu Cys Pro His Thr Tyr Gln Asn Arg Arg Pro Phe Ser 965 970 975 aga gag atg gat ttg ggg cta ctc tct ccg cag gct cag gtc gaa gat 2976 Arg Glu Met Asp Leu Gly Leu Leu Ser Pro Gln Ala Gln Val Glu Asp 980 985 990 tcg tag 2982 Ser 8 993 PRT Homo sapiens 8 Met Pro Ala Leu Ala Arg Asp Ala Gly Thr Val Pro Leu Leu Val Val 1 5 10 15 Phe Ser Ala Met Ile Phe Gly Thr Ile Thr Asn Gln Asp Leu Pro Val 20 25 30 Ile Lys Cys Val Leu Ile Asn His Lys Asn Asn Asp Ser Ser Val Gly 35 40 45 Lys Ser Ser Ser Tyr Pro Met Val Ser Glu Ser Pro Glu Asp Leu Gly 50 55 60 Cys Ala Leu Arg Pro Gln Ser Ser Gly Thr Val Tyr Glu Ala Ala Ala 65 70 75 80 Val Glu Val Asp Val Ser Ala Ser Ile Thr Leu Gln Val Leu Val Asp 85 90 95 Ala Pro Gly Asn Ile Ser Cys Leu Trp Val Phe Lys His Ser Ser Leu 100 105 110 Asn Cys Gln Pro His Phe Asp Leu Gln Asn Arg Gly Val Val Ser Met 115 120 125 Val Ile Leu Lys Met Thr Glu Thr Gln Ala Gly Glu Tyr Leu Leu Phe 130 135 140 Ile Gln Ser Glu Ala Thr Asn Tyr Thr Ile Leu Phe Thr Val Ser Ile 145 150 155 160 Arg Asn Thr Leu Leu Tyr Thr Leu Arg Arg Pro Tyr Phe Arg Lys Met 165 170 175 Glu Asn Gln Asp Ala Leu Val Cys Ile Ser Glu Ser Val Pro Glu Pro 180 185 190 Ile Val Glu Trp Val Leu Cys Asp Ser Gln Gly Glu Ser Cys Lys Glu 195 200 205 Glu Ser Pro Ala Val Val Lys Lys Glu Glu Lys Val Leu His Glu Leu 210 215 220 Phe Gly Thr Asp Ile Arg Cys Cys Ala Arg Asn Glu Leu Gly Arg Glu 225 230 235 240 Cys Thr Arg Leu Phe Thr Ile Asp Leu Asn Gln Thr Pro Gln Thr Thr 245 250 255 Leu Pro Gln Leu Phe Leu Lys Val Gly Glu Pro Leu Trp Ile Arg Cys 260 265 270 Lys Ala Val His Val Asn His Gly Phe Gly Leu Thr Trp Glu Leu Glu 275 280 285 Asn Lys Ala Leu Glu Glu Gly Asn Tyr Phe Glu Met Ser Thr Tyr Ser 290 295 300 Thr Asn Arg Thr Met Ile Arg Ile Leu Phe Ala Phe Val Ser Ser Val 305 310 315 320 Ala Arg Asn Asp Thr Gly Tyr Tyr Thr Cys Ser Ser Ser Lys His Pro 325 330 335 Ser Gln Ser Ala Leu Val Thr Ile Val Gly Lys Gly Phe Ile Asn Ala 340 345 350 Thr Asn Ser Ser Glu Asp Tyr Glu Ile Asp Gln Tyr Glu Glu Phe Cys 355 360 365 Phe Ser Val Arg Phe Lys Ala Tyr Pro Gln Ile Arg Cys Thr Trp Thr 370 375 380 Phe Ser Arg Lys Ser Phe Pro Cys Glu Gln Lys Gly Leu Asp Asn Gly 385 390 395 400 Tyr Ser Ile Ser Lys Phe Cys Asn His Lys His Gln Pro Gly Glu Tyr 405 410 415 Ile Phe His Ala Glu Asn Asp Asp Ala Gln Phe Thr Lys Met Phe Thr 420 425 430 Leu Asn Ile Arg Arg Lys Pro Gln Val Leu Ala Glu Ala Ser Ala Ser 435 440 445 Gln Ala Ser Cys Phe Ser Asp Gly Tyr Pro Leu Pro Ser Trp Thr Trp 450 455 460 Lys Lys Cys Ser Asp Lys Ser Pro Asn Cys Thr Glu Glu Ile Thr Glu 465 470 475 480 Gly Val Trp Asn Arg Lys Ala Asn Arg Lys Val Phe Gly Gln Trp Val 485 490 495 Ser Ser Ser Thr Leu Asn Met Ser Glu Ala Ile Lys Gly Phe Leu Val 500 505 510 Lys Cys Cys Ala Tyr Asn Ser Leu Gly Thr Ser Cys Glu Thr Ile Leu 515 520 525 Leu Asn Ser Pro Gly Pro Phe Pro Phe Ile Gln Asp Asn Ile Ser Phe 530 535 540 Tyr Ala Thr Ile Gly Val Cys Leu Leu Phe Ile Val Val Leu Thr Leu 545 550 555 560 Leu Ile Cys His Lys Tyr Lys Lys Gln Phe Arg Tyr Glu Ser Gln Leu 565 570 575 Gln Met Val Gln Val Thr Gly Ser Ser Asp Asn Glu Tyr Phe Tyr Val 580 585 590 Asp Phe Arg Glu Tyr Glu Tyr Asp Leu Lys Trp Glu Phe Pro Arg Glu 595 600 605 Asn Leu Glu Phe Gly Lys Val Leu Gly Ser Gly Ala Phe Gly Lys Val 610 615 620 Met Asn Ala Thr Ala Tyr Gly Ile Ser Lys Thr Gly Val Ser Ile Gln 625 630 635 640 Val Ala Val Lys Met Leu Lys Glu Lys Ala Asp Ser Ser Glu Arg Glu 645 650 655 Ala Leu Met Ser Glu Leu Lys Met Met Thr Gln Leu Gly Ser His Glu 660 665 670 Asn Ile Val Asn Leu Leu Gly Ala Cys Thr Leu Ser Gly Pro Ile Tyr 675 680 685 Leu Ile Phe Glu Tyr Cys Cys Tyr Gly Asp Leu Leu Asn Tyr Leu Arg 690 695 700 Ser Lys Arg Glu Lys Phe His Arg Thr Trp Thr Glu Ile Phe Lys Glu 705 710 715 720 His Asn Phe Ser Phe Tyr Pro Thr Phe Gln Ser His Pro Asn Ser Ser 725 730 735 Met Pro Gly Ser Arg Glu Val Gln Ile His Pro Asp Ser Asp Gln Ile 740 745 750 Ser Gly Leu His Gly Asn Ser Phe His Ser Glu Asp Glu Ile Glu Tyr 755 760 765 Glu Asn Gln Lys Arg Leu Glu Glu Glu Glu Asp Leu Asn Val Leu Thr 770 775 780 Phe Glu Asp Leu Leu Cys Phe Ala Tyr Gln Val Ala Lys Gly Met Glu 785 790 795 800 Phe Leu Glu Phe Lys Ser Cys Val His Arg Asp Leu Ala Ala Arg Asn 805 810 815 Val Leu Val Thr His Gly Lys Val Val Lys Ile Cys Asp Phe Gly Leu 820 825 830 Ala Arg Asp Ile Met Ser Asp Ser Asn Tyr Val Val Arg Gly Asn Ala 835 840 845 Arg Leu Pro Val Lys Trp Met Ala Pro Glu Ser Leu Phe Glu Gly Ile 850 855 860 Tyr Thr Ile Lys Ser Asp Val Trp Ser Tyr Gly Ile Leu Leu Trp Glu 865 870 875 880 Ile Phe Ser Leu Gly Val Asn Pro Tyr Pro Gly Ile Pro Val Asp Ala 885 890 895 Asn Phe Tyr Lys Leu Ile Gln Asn Gly Phe Lys Met Asp Gln Pro Phe 900 905 910 Tyr Ala Thr Glu Glu Ile Tyr Ile Ile Met Gln Ser Cys Trp Ala Phe 915 920 925 Asp Ser Arg Lys Arg Pro Ser Phe Pro Asn Leu Thr Ser Phe Leu Gly 930 935 940 Cys Gln Leu Ala Asp Ala Glu Glu Ala Met Tyr Gln Asn Val Asp Gly 945 950 955 960 Arg Val Ser Glu Cys Pro His Thr Tyr Gln Asn Arg Arg Pro Phe Ser 965 970 975 Arg Glu Met Asp Leu Gly Leu Leu Ser Pro Gln Ala Gln Val Glu Asp 980 985 990 Ser 

I claim:
 1. A method for testing the interaction between a drug and an enzyme from a tumor of a patient, the method comprising the steps of: (a) obtaining from a patient a sample of tumor cells producing an enzyme to be tested for the ability of a drug to interact with the enzyme; (b) determining the allele of the enzyme by analyzing a biomolecule from the tumor cells, wherein the biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of the enzyme, a nucleotide sequence encoding a fragment of the enzymatic site of the enzyme, a polypeptide fragment from the enzymatic site of the enzyme, and the enzyme; and (c) comparing the allele of the enzyme to a reference to determine the ability of the drug to interact with the enzyme.
 2. The method of claim 1 further comprising the step of testing the tumor cells to identify the phosphorylation state of the enzyme.
 3. The method of claim 1 further comprising the step of placing the enzyme in direct contact with the drug to further test the sensitivity of the enzyme to the drug.
 4. The method of claim 1 wherein the reference is either a reference biomolecule, a reference database, or a control.
 5. The method of claim 4 wherein the reference biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of a reference enzyme known to interact with the drug, a nucleotide sequence encoding a fragment of the enzymatic site of the reference enzyme, a polypeptide fragment from the enzymatic site of the reference enzyme, the reference enzyme, a polyclonal antibody, a monoclonal antibody and a probe directed to the reference enzyme.
 6. The method of claim 5 wherein the enzymatic site comprises a portion of the activation loop for the reference enzyme.
 7. The method of claim 4 wherein the reference biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of a c-KIT reference enzyme known to interact with the drug, a nucleotide sequence encoding a fragment of the enzymatic site of the c-KIT reference enzyme, a polypeptide fragment from the enzymatic site of the c-KIT reference enzyme, the c-KIT reference enzyme, a polyclonal antibody, a monoclonal antibody and a probe directed to the c-KIT reference enzyme.
 8. The method of claim 1 wherein the allele is determined by using either a gene chip, a probe, a monoclonal antibody, a polyclonal antibody, by sequencing the biomolecule, or by performing a nucleotide polymorphism analysis.
 9. A method for testing the interaction between a drug and an enzyme from a tumor of a patient, the method comprising the steps of: (a) obtaining from a patient a sample of tumor cells producing an enzyme to be tested for the ability of a drug to interact with the enzyme; (b) determining the allele of the enzyme by analyzing a biomolecule from the tumor cells, wherein the biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of the enzyme, a nucleotide sequence encoding a fragment of the enzymatic site of the enzyme, a polypeptide fragment from the enzymatic site of the enzyme, and the enzyme; and (c) comparing the allele of the enzyme to a reference biomolecule to determine the ability of the drug to interact with the enzyme, wherein the reference biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of a reference enzyme known to interact with the drug, a nucleotide sequence encoding a fragment of the enzymatic site of the reference enzyme, a polypeptide fragment from the enzymatic site of the reference enzyme, the reference enzyme, a polyclonal antibody, a monoclonal antibody and a probe.
 10. The method of claim 9 further comprising the step of testing the tumor cells to identify the phosphorylation state of the enzyme.
 11. The method of claim 9 further comprising the step of placing the enzyme in direct contact with the drug to further test the sensitivity of the enzyme to the drug.
 12. The method of claim 9 wherein the allele is determined by using either a gene chip, a probe, a monoclonal antibody, a polyclonal antibody, by sequencing the biomolecule, or by performing a nucleotide polymorphism analysis.
 13. The method of claim 9 wherein the reference biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of a c-KIT reference enzyme known to interact with the drug, a nucleotide sequence encoding a fragment of the enzymatic site of the c-KIT reference enzyme, a polypeptide fragment from the enzymatic site of the c-KIT reference enzyme, the c-KIT reference enzyme, a polyclonal antibody, a monoclonal antibody and a probe directed to the c-KIT reference enzyme.
 14. The method of claim 13 wherein the enzymatic site comprises a portion of the activation loop for the reference enzyme.
 15. A kit for testing the interaction between a drug and an enzyme from a tumor of a patient, the kit comprising a research tool for determining the allele of the enzyme and a reference for comparison against the allele of the enzyme to determine the ability of the drug to interact with the enzyme.
 16. The kit of claim 15 wherein the research tool determines the allele of the enzyme by analyzing a biomolecule from the tumor, wherein the biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of the enzyme, a nucleotide sequence encoding a fragment of the enzymatic site of the enzyme, a polypeptide fragment from the enzymatic site of the enzyme, and the enzyme.
 17. The kit of claim 15 wherein the research tool is either a gene chip, a probe, a monoclonal antibody, a polyclonal antibody, or materials for sequencing the biomolecule or performing a nucleotide polymorphism analysis.
 18. The kit of claim 15 wherein the reference is either a reference biomolecule, a reference database, or a control.
 19. The kit of claim 18 wherein the reference biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of a reference enzyme known to interact with the drug, a nucleotide sequence encoding a fragment of the enzymatic site of the reference enzyme, a polypeptide fragment from the enzymatic site of the reference enzyme, the reference enzyme, a polyclonal antibody, a monoclonal antibody and a probe.
 20. The kit of claim 18 wherein the reference biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of a c-KIT reference enzyme known to interact with the drug, a nucleotide sequence encoding a fragment of the enzymatic site of the c-KIT reference enzyme, a polypeptide fragment from the enzymatic site of the c-KIT reference enzyme, the c-KIT reference enzyme, a polyclonal antibody, a monoclonal antibody and a probe directed to the c-KIT reference enzyme.
 21. The method of claim 20 wherein the enzymatic site comprises a portion of the activation loop for the reference enzyme.
 22. A method for testing the interaction between a drug and a c-KIT target enzyme from a tumor of a patient, the method comprising the steps of: (a) obtaining from a patient a sample of tumor cells producing a c-KIT target enzyme; (b) determining the allele of the c-KIT target enzyme by analyzing a biomolecule from the tumor cells, wherein the biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of the c-KIT target enzyme, a nucleotide sequence encoding a fragment of the c-KIT target enzymatic site of the enzyme, a polypeptide fragment from the enzymatic site of the c-KIT target enzyme, and the c-KIT target enzyme; and (c) comparing the allele of the c-KIT target enzyme to a reference to determine the ability of the drug to interact with the c-KIT target enzyme.
 23. The method of claim 22 further comprising the step of testing the tumor cells to identify the phosphorylation state of the c-KIT target enzyme.
 24. The method of claim 22 further comprising the step of placing the c-KIT target enzyme in direct contact with the drug to further test the sensitivity of the c-KIT target enzyme to the drug.
 25. The method of claim 22 wherein the reference is either a reference biomolecule, a reference database, or a control.
 26. The method of claim 25 wherein the reference biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of a c-KIT reference enzyme known to interact with the drug, a nucleotide sequence encoding a fragment of the enzymatic site of the c-KIT reference enzyme, a polypeptide fragment from the enzymatic site of the c-KIT reference enzyme, the c-KIT reference enzyme, a polyclonal antibody, a monoclonal antibody and a probe directed to the c-KIT reference enzyme.
 27. The method of claim 25 wherein the enzymatic site comprises a portion of the activation loop for the reference enzyme.
 28. The method of claim 22 wherein the allele is determined by using either a gene chip, a probe, a monoclonal antibody, a polyclonal antibody, by sequencing the biomolecule, or by performing a nucleotide polymorphism analysis.
 29. A method for testing the interaction between a drug and a c-KIT target enzyme from a tumor of a patient, the method comprising the steps of: (a) obtaining from a patient a sample of tumor cells producing a c-KIT target enzyme; (b) determining the c-KIT target allele of the enzyme by analyzing a biomolecule from the tumor cells, wherein the biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of the c-KIT target enzyme, a nucleotide sequence encoding a fragment of the enzymatic site of the c-KIT target enzyme, a polypeptide fragment from the enzymatic site of the c-KIT target enzyme, and the c-KIT target enzyme; and (c) comparing the allele of the enzyme to a reference biomolecule to determine the ability of the drug to interact with the c-KIT target enzyme, wherein the reference biomolecule is selected from the group consisting of a nucleotide sequence encoding an enzymatic site of a c-KIT reference enzyme known to interact with the drug, a nucleotide sequence encoding a fragment of the enzymatic site of the c-KIT reference enzyme, a polypeptide fragment from the enzymatic site of the c-KIT reference enzyme, the c-KIT reference enzyme, a polyclonal antibody, a monoclonal antibody and a probe directed to the c-KIT reference enzyme.
 30. The method of claim 29 further comprising the step of testing the tumor cells to identify the phosphorylation state of the c-KIT target enzyme.
 31. The method of claim 29 further comprising the step of placing the c-KIT target enzyme in direct contact with the drug to further test the sensitivity of the c-KIT target enzyme to the drug.
 32. The method of claim 29 wherein the allele is determined by using either a gene chip, a probe, a monoclonal antibody, a polyclonal antibody, by sequencing the biomolecule, or by performing a nucleotide polymorphism analysis. 